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NOMBRES EXPONENTIELS ET 
NOMBRES DE BERNOULLI 


JACQUES TOUCHARD 


Introduction. Les nombres entiers positifs a<>, oi, 
la fonction g6n£ratrice 



a„, . . . d£finis par 


et que Ton appelle nombres exponentiels jouent, pour la sommation de cer- 
taines series, un r61e qui rappelle le r61e sommatoire des nombres de Bernoulli. 
Nous avons rassembI6 ici les principales propri6t6s des nombres a n dont 
plusieurs sont, croyons nous, nouvelles. L’une d’elles qui se pr£sente spon- 
tan£ment, comme on le verra, est l’existence d’une orthogonalit6 symbolique 
par rapport & ces nombres. C’est ce qui nous a conduit k rechercher si Ton 
pouvait former une suite de polyndmes orthogonaux symboliquement par 
rapport aux nombres de Bernoulli. On y parvient en effet, mais beaucoup 
moins facilement, grkce k une fraction continue donn£e par Stieltjes. 

Nous 6tudions aussi des polynomes exponentiels. 

Nous ferons constamment usage du calcul symbolique, appel6 calcul de 
Blissard. La plupart des demonstrations sont si simples qu’il suffira le plus 
souvent de les esquisser. 


Propri£t£s des nombres exponentiels 
1. En diff£rentiant la formule symbolique 

(1) e**" 1 = e at , 
on obtient 

(2) fl„ +1 = (a + 1)* 
et, en posant dans (1), e* — 1 = u, on trouve 

(3) a(a — 1) ... (a — n -f* 1) = 1. 


Plus g6n£ralement, f{u) d6signant un polyndme quelconque, 


(4) 


/(a + 1) = a /(a), 

f(a + p)=a(a-l)...(a-p+ 1 )/(a). 


L’expression de/(«) par la formule de Newton donne, en vertu de (3) 

(5) 


m =/< o) + ^ + . . . 
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et aussi 


}(a + x) =/«+^P + ^P + ..., 

les differences 6tant prises pour la suite des valeurs x, x + 1, x 2, . . . de la 
variable. En particulier 


A0 P , AV , , A P 0 P 

a p = ~7~ + ~7T7~ + • • • 4* 


( 6 ) 


1 


a,+i = l p + 


2! 

Al p A 2 1 p 


>i 


V\V 


+ . . • + 


A 1 


P > 1, 
P > 0. 


1 * 2! p\ * 

Dans (5), substituons les expressions des differences successives, savoir 

A’/(0) = /(£) - (f) f(p - 1) + (£)f(p - 2) - , 

admettons ensuite que le polynSme f{u) soit de degre < n et posons 


nous obtiendrons 


. X , 1,1 1 , , (-1)" 

w{n) = l- I + -- - + ... + — r 




f(a) = w(n)f{ 0) + . . . + w(n - i ) J — r -f . . . + w(0) 


1! 


J(n) 

n\ 


et, en faisant grandir n indefiniment, 


”o n\ 


On voit d’ailleurs directement, en multipliant les deux membres de (1) par 
e xt et en developpant que 


e{a + x) n = 


(* + i) n 


iTo l\ 

et que, /( m) designant un polyndme quelconque, 

(7) ./(« + *)- 

n-0 n\ 

C’est 15. la propriete sommatoire des nombres exponentiels. En particulier 

n > 0, 


( 8 ) 


(\n q» 

e a n+ 1 = 1" + J + 2 ! + • • • , 


6 do 


= Eh 


o n) 


Les valeurs suivantes sont empruntees k Bell (4). Le calcul a ete poursuivi 
par Becker (1) jusqu’5 l’indice n = 35 et par Miksa (6a, p. 54) jusqu’5 l’indice 
n — 51. 
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« 

0 

1 

2 

3 

4 

5 


a» 

1 

1 

2 

5 

15 

52 


n a n 

6 203 

7 877 

8 4140 

9 21147 

10 115975 


2. Remplagons maintenant, dans liquation (5 ),/(x) par A n f(x) et faisons 
usage, au premier membre, de la formule (4) pour p = 0, 1, 2, . . . , n, nous 
aurons symboliquement 


h n (a)f(a) = A B /(0) + 


A’+VtO) A" + V( 0) 

1! 2! 


+ • • 




Oil 

(9) h n (x ) = X (~1) \ ) *(* — 1) ...(* — » + * + 1). 

(-0 \ t / 


On en d6duit que 

( 10 ) 



0 < p < n, 
P — n. 


Gr&ce k ces formules on peut, connaissant a 0 , ai, a 2 , . . . , a„_i et le seul poly- 
n6me h n (x) t calculer a„, a B+ i, . . . , a 2 *. On voit de plus que les polyndmes 
h n (x) jouissent de la belle propri6t6 d’etre orthogonaux symboliquement par 
rapport aux nombres a n : 


(ID 


h m (a ) h n (a) 



m 7* ti, 

m — n. 


D’apr&s (9), la fonction g6n£ratrice des polyndmes h n (x) est 


(1 + z) x e ' = £ h n(x) 


n«= 0 


n\ 


En faisant, dans (7), x = 0 et f(u) = h m (u)h n (u), on voit, d’apr^s (11), 
que 

h m (k ) h n (k) _ /o, m n, 

kZ o k\ [eml, m — n. 

C’est 1& un r6sultat qui se rattache k la th£orie des polyn6mes orthogonaux 
de Charlier-Poisson (10): 

ho — 1, 

hi = x — 1 , 

hi = x 2 — 3 x + 1, 

Zt 3 = x 3 — 6 x 2 + 8 x — 1, 

h t = x* - 10 x 3 + 29 x 2 - 24x + 1, 

ho — x 6 — 15 x 4 + 75 x 3 — 145 x 2 + 89 x — 1, 

ho — x® 21 x 6 + 160 x 4 - 545 x 3 + 814 x 2 - 415 x + 1. 
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3. Les propri6t6s exposdes dans les deux paragraphes pr6c£dents sont 
fondamentales. En voici d’autres. 

Dans (1), posons e * = 1/(1 — jc) et divisons par (1 — x ), nous aurons 

(i - *r i e x,a ~ x) = (i - x)-*- 1 . 

Le premier membre se d£veloppe au moyen des polyndmes de Laguerre, 
d’oa 

(12) . (a + l)(o + 2)...(o + «) = E (”}«!. 

Cette formule r6sulte aussi de la relation 

(a + x)(a + x — 1) . . . (a + x — n + 1) = 2 ( ^ )*(* — 1) . • • (x — n+1). 

<-o \ i / 

Dans (1), posons e* = (1 + x) 2 , nous aurons 


2 z+z* 


= (1 + *)**. 


Le premier membre se d 6 veloppe au moyen des polyndmes d’Hermite, d’oil 


2a(2a — 1 ) . . . ( 2 a — n + 1 ) = n! J*. ^ 77 ^ 


n-2 i 


"f i\ (n — 2 i) \ * 

la somme s’ 6 tendant aux valeurs de i depuis z£ro jusqu’& Tender de \n. 
Faisons maintenant, dans (1), e* = (1 — x)*, il vient 

(13) e u-x)i-i = (i _ x )i\ 

Si Ton pose, pour un moment, 

(14) y(x) 

la fonction y(x) satisfait & liquation diff£rentielle 

(4 - 4x) y"(x) = 2 y' (x) + y{x), 
d’oil se tire la recurrence 


v£»*_ 
e ~ -V 2” «! ' 


(15) 


Cn+2 — (2 It ~f* 1) C n + 1 C n , 

0 

II 

1 

c 6 = 329, 

Ci - 1, 

ci = 3655, 

Ci = 0 , 

c 8 = 47844, 

c 3 = 1 , 

c 9 = 721315, 

Ci = 5, 

cio = 12310199, 

c 6 = 36, 

Cn = 234615096, 


et l’on voit, d’apr£s (15), qu’& partir de n = 4 les nombres c n sont les d£nomi- 
nateurs des r 6 duites de la fraction continue 


11 

'5 + 


II 1| 

+ |9 +|Tl + 
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qui est une fraction continue de Gauss. On en d6duit l’expression de c n+ • 
pour n > 0 


Cn+i 


(16) 


- 6.7-9... (2» + S)[ 1 + (lW 

(” - ')— 

\ 2 /5- 7- 


(2n + 5) 


4- 


+ 


(2w + 3)-(2» + 5) 

1 

\ 3 Z5-7-9- 


(2« + 1)(2« + 3)(2n 4- 5) 
Les formules (13) et (14) donnent alors 


+ 


• • • j • 


(17) 


a(a — 2) (o — 4) ... (a — 2& + 2) = ( — 1)* V* 


Par analogic avec la formule (3), consid£rons l’int6grale 

-. (” !)• . ..■■■ = f (1 - x) a x-'dx. 

(a -f l)(a 4 2) . . . (d 4 w) Jo 

Comme on a symboliquement 

(i - x) a - 

cette 6galit6 devient 

~ l)! = f«-*j <T l dx = P{n), 

I Jo 


^ ^ (a 4 1) (fl 4 2) . . . (a 4 w) 

P(x) d6signant la fonction bien connue de Prym. On a done 


1 


(a 4 l)(a 4 2) ... (a 4 w) 
ou bien, d’apr£s une propri6t6 connue de P(z), 

e 


m = i — 1 

r(») * L 1 2! *•* (n-l)U 


= E 


i 


(a 4* l)(a + 2) . . . (a 4* n) i^i T(n 4" i) ' 

ce qui s’accorde avec la formule (7), quand on y fait x — 0 et /(«) = r(tt+l)/ 
T(w + « + 1). II est facile de d6montrer que si Ton d6veloppe le premier 
membre de (18) suivant les puissances positives de a et si Ton somme, par la 
mdthode de Borel, la s6rie divergente obtenue en remplagant a n par a n , on 
obtientP(w). 

Consid6rons enfin les nombres 


Qn = CLnP-0 


— ^ d n —\CL\ 4- a n-&2 — ... 4" (-1)’ 


doOn 


ou, symboliquement, 

(19) q K = (a — a')*- 

Ces nombres q n sont les invariants quadratiques des formes binaires (x 4- fly)*. 
Ceux d’indice impair sont nuls. D’apr^s (19), leur fonction g6n£ratrice est 
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(20) 


„«*+«-*— 2 




2_ 

n\ 




et, en diff6rentiant cette Equation, on obtient la recurrence tr&s simple 

q n+1 = (q + l) n - ( 2 - l) n , 

qui donne 

qo =1, qs = 3614, 

52 = 2, 510 = 99302, 

g 4 = 14, qn = 3554894, 

q< = 182, ?14 = 159175382. 

Dans (20), changeons 2 en 2 iz, remarquons que 

e 2i! + e -2< * — 2 = — 4 sin 2 2 

et posons sin 2 = \u y nous ob tenons, puisque qi n +i = 0, l’6galit6 symbolique 


U 2 ( 0 . u\ 

e = cosl 2 q arcsin • 


(21) 

Or on sait que 

( n . u\ , m 2 2 , m 2 (m 2 - l 2 ) 4 

cosl 2 m arcsin - 1 = 1 — u + m — . . . . 


Le d6veloppement des deux membres de (21) conduit done k la formule 

(22) q 2 (q 2 - l 2 )(q 2 - 2 2 ) . . . [q 2 - (k - 1)*] = . 

Les nombres q n jouissent d’une propri6t6 sommatoire que nous 6tablirons 
plus loin. 


4. Une d6finition int£ressante des nombres exponentiels a n a 6t6 donn6e 
par Broggi (5) au moyen de la s6rie asymptotique 

(23) /(*) = e~' f V~‘ e'dl = £ . R(x) > 0 

•JO i=0 X 


qu’on obtient en d6veloppant d’abord e\ en integrant terme 5 terme, en 
d6veloppant ensuite les fractions suivant les puissances de \/x et en ayant 
6gard aux formules (8). Or la fonction I ( x ) peut etre repr6sent6e par la fraction 
continue 


I(x) = 


1 

x -f 1 


1 | 2 | 3 

\x 2 \x + 3 [# + 4 


En d6signant les r6duites par a n //3„, on a 


ao _ 0 oj _ 1 

0o “ 1 ’ 0i “ * + 1 ’ * * * 
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et, en general, a „ est un polyn6me de degr6 n — 1, /3„ un polynSme de degre 
n, qui satisfont tous les deux k liquation aux differences 

«n = (x + n) i - in - 1) 2 . 

Partant de cette relation et des valeurs initiales j9 0 = 1, 0i = x + 1, un 
calcul facile montre que 

E &(*)£- a 

o n\ 

d’ou se tire l’expression 

Pn(x) = (- 1 )* 


A„(x) etant le polynbme (9). II r£sulte alors de la th£orie des fractions con- 
tinues que 


«»(*) _ Oo _ ai 
p n (x) X ~x 


Q2w-1 , 
x 2n 



/?„(l/x) d^signant une s6rie en 1/x, et les propri£t6s d’orthogonalite symboli- 
que des polynbmes h n (x), que nous avons rencontres au §2, sont, comme il est 
ais£ de le voir, une consequence immediate de cette formule. Nous utiliserons 
cette remarque plus loin. 


5. Epstein (5) a consider la fonction entire de 5 
(24) = p + + + 


ig(-n) = ea n +i, 

U(l) =6-1 


On a evidemment 
(25) 

et, d’apr^s une integrale euierienne classique, 


n > 0, 


n > 1 


formule que nous general iserons plus loin. Epstein a calcuie les valeurs de 
e~ l g(n), pour n entier positif jusqu’5 n = 21, mais, sauf pour n — 0 et n = 1, 
ces valeurs ne paraissent pas pouvoir s’exprimer k l’aide de nombres connus. 
En posant g{n ) = g n , on verifie sans peine la relation symbolique 

(1 - g)(l - 2g) ... (1 - ng) 

(» + !)!(« + 2) B ^ 2! (« + 3)" ^ ” * 


qui presente une certaine analogic avec (3) et constitue une formule de 
recurrence approchee des nombres g„. 
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PolynOmes EXPONENTIELS 

6. Les polyndmes <£„(*) dont nous d6sirons nous occuper et qui se pr£sentent 
souvent en analyse sont d6finis par la fonction g6n6ratrice 

(27) 

n -0 n\ 

0o - 1, 

0 1 = X, 

0 2 = X 2 + X , 

03 = X 3 -f 3 X 2 + X, 

0 4 = * 4 + 6 x 3 + 7 x 2 + 

05 = x 6 + 10 x 4 + 25 X 3 + 15 X 2 + x. 

On peut aussi les d6finir par le d6veloppement asymptotique 

e 1 f e**t M l dt = - <j>o(x) — ~2 0i(*) + . . • + ■“n+r - 0n(*) + . . . . 

J 0 2 2 Z 

La plupart des propri6t6s des nombres a n peuvent $tre 6tendues aux poly- 
ndmes 0„. Enum6rons les principales: 

0n(l) = &n> 

<t>n+l(x) = X(<f> -f 1)", 

(28) 0n+l(*) = X(0 n + 00, 

0n = (0 + 1)” — 0n» 

0(0 — 1) . . . (0 — fl + 1) = x n . 

Si on pose j n (x) — x(x — 1) . . . (x — n + 1). cette formule s’6crit symboli- 
quement j„(0) = x* et Ton a aussi <f> n {j ) — x”. Les polynOmes j % et <f> n sont 
done inverses les uns des autres. 

Nous avons ensuite 

*.(*) - 0* + Y AO" + a ! o” + . . . + ^ A"0", 

•’'*>(*) = St?. 

*-o 5! 

e*<f>n+i(x) = xit (s + l) n , n > 0. 

»-o 5; 

Plus g6n6ralement, f(u ) d£signant un polyndme quelconque de degr6 «, 


on a 




*/(0 + 1) 

= /(0), 

(29) 

*”/(0 + p ) 

— 0(0 ~ 1) • • • (0 — P + l)/(0), 


/(0) 

- m + f Af(0) + | A ! /(0) + . . . + A*/(0) 

(30) 

«*/(0 + k) 

£ **/(* + *) 
j— o 5 ! 
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Cette derni£re formule, oil k est un nombre quelconque, exprime la propri6t6 
sommatoire des polyndmes <f> n . Liquation (28) est 6quivalente k 

e dr. || 

En utilisant cette relation et en integrant plusieurs fois par parties le second 
membre de (26), on obtient, k 6tant un entier positif ou nul, 

} ~ (» + *)! Jo t 6 log Ut ' H> k ' 

qui est un prolongement de l’int6grale (26) pour des valeurs negatives de n. 

7. En raisonnant comme au §2 et en s’appuyant sur la formule (29), on 
verra que les polyn6mes 

H n (x, z) *= z(z — 1 ) ...(« — » 4 - 1 ) — x z(z — 1 ) . . . (z — n 4 - 2 ) 

+ ( 2 — 1) • • • ( z ~ n + 3) — . . . + (*-l)V 

satisfont aux relations symboliques 

H n {x, <f>) = 0 , 

<f>) = 0 , 


4r l H n (x, <f>) = 0, 

<t> n H n (x, <t>) = n\x n 


et que, par consequent, ces polyn6mes sont orthogonaux symboliquement par 
rapport aux polyndmes <t> n (x). En faisant usage de la formule (30) pour k = 0 
et/(«) = H m (x, u) ‘H n (x, u) on aura 

yy x*H m (x, s) H n (x, s) __ )0, m 7 * n, 

£o s! Ira! x m e x , m = n. 


On observera que ce sont 1& de pures identity et que Ton peut y remplacer 
les diverses puissances de x par des nombres arbitraires. 

Notons encore que, par analogie avec les nombres q n du §3, on peut consid£- 
rer les polyndmes definis par l’£galit6 


Xn(x) = 4>n<> 0 — 






<t>n-2<t>2 — + ( — I)* (f> 0 <t>n- 


On d6montrera comme au §3 la belle relation symbolique 

x(x - l 2 )(x* - 2 ! ) . . . [x 2 - (k - 1) ! ] = ^x*. 
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Propri£tes ARITHMETIQUES 

8. Les nombres exponentiels satisfont k diverses congruences dont les plus 
simples se tirent de (2), (3), (4) et de la congruence identique de Lagrange 

(31) x(x — 1) ... (x — p 1) = x p — x (mod p) 

oil p d6signe, comme dans tout le reste de ce paragraphe, un nombre premier. 
Voici quelques unes de ces congruences 

(32) a P = 2, a p+ i = 3, a p v +h = v a h + a h + 1 (mod p), 

et notamment . , . 

a p v = v + 1 (mod p) 

relation qui montre que la suite des restes (mod p) de a x , a p , a p t, . . . admet 
la p6riode p. 

D’apr&s les formules (4) et (12) on a symboliquement 
[a(a — 1) ... (a — p + l)] 2 = (a + 1) (a + 2) . . . (a + p) = 1 + p\ (mod p 2 ) 
or, d’apr^s le th6or£me de Wilson 

p\ -f- p = 0 (mod p 2 ) 

done 

(33) [a(a — 1) ... (a — p + l)] 2 = 1 — p (mod p 2 ). 

D’autre part, en vertu de (31), 

[x{x — 1) . . . {x — p + l)] 2 — 2x{x — 1) . . . {x — p + lXac” — x) 

+ (jc” — x) 2 = 0 (mod p 2 ). 

En remplagant x* par a k et en transformant le second terme k l’aide de (4), 
on obtient 

[a(a — 1) ... (a — p + l)] 2 — 2 (a p — aq — p) + a 2p — 2a P+i + a 2 s 0 

(mod p*) t 

et en comparant k (33) on trouve la relation 

a 2p — 2 a p+ i — 2a p ~i~p + 5 = 0 (mod p 2 ). 

Une autre congruence concerne les nombres c k du §3 et se tire de la formule 
(17). On a 6videmment, lorsque p est un nombre premier impair, 

x(x — 2) (x — 4) ... (x — 2p 2) = x p — x (mod p) 

et, par consequent, d’apr^s (17) et (32) 

c p = 1 (mod p), p premier impair. 

On a ensuite, d’apr£s (15), 

c p + 2 — c p+ i = 1 (mod p). 

Consid6rons maintenant les nombres q n du §3 et supposons encore p premier 
impair. On sait que 

(34) x(x — 1 2 )(* — 2 2 ) . . . ^ J — ^ 1(P+1) — x (mod p). 
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D’autre part, dans la formule (22), faisons k = \{p + 1); Ie second membre 
devient divisible par p. Done, en faisant dans (34) x = q 2 symboliquement, 
on obtient 

q P +i — Q 2 = 0 (mod p), 

ou bien 

(35) Qp+i = 2 (mod p), p premier impair. 

Maintenant, on a aussi identiquement 

(* - 1*)(* - 2 2 ) ...[*-(/> - 1)’] - - 1)’ (mod p)-, 

d’oCk 

(36) x(x — l 2 ) . . . [x — (p — l) 2 ] s y? — 2x* (p+1) + x (mod p); 

mais si dans (22) on fait k = p le second membre est divisible par p. On a 
done, en rempla$ant dans (36) x par q 2 symboliquement 

?2 v — 2 Qp+i + = 0 (mod p), 

et, en vertu de (35) 

(37) q 2 p = 2 (mod p ), p premier impair. 

En comparant (35) et (37) on voit que, si p et 2p — 1 sont tous deux premiers 
impairs, on a 

qi v = 2 (mod p(2p — l)) f 

congruence que Ton peut verifier pour p = 3 et p = 7 et qui a lieu aussi 
pour p — 2. 

Les polyn6mes 4> n (x) satisfont aussi 5 diverses congruences, soit qu’on 
consid£re x comme un nombre entier, soit plut6t qu’on consktere x comme 
une ind6termin£e, racine d’une congruence irr£ductible (mod />), c’est-&-dire 
comme une imaginaire de Galois. Nous renverrons h ce sujet 5 ( 12 ). 


Integrales d£finies 


9. On peut de plusieurs manteres obtenir l’expression des nombres a„, des 
polyn6mes <£„ et de la fonction g(s) par des integrates d£finies. En voici quelques 
unes. 

L’int6grale euterienne de deuxteme esp£ce donne d’abord 

r(5) g(s) = J Vv"V~\fcc, R(s) > 0 


et on en tire par un proc£d£ connu la formule, valable dans tout le plan s, 


g (1 - s) 
I'M 



oii C d6signe un lacet partant de — <» avec l’argument —t pour z et y revenant 
avec l’argument +?r, apr&s avoir entour£ 1’origine. 


| 
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En partant d’une formule, dtie h Laplace, 


X 


+ °° e iu *dz 


(1 + ^ 


2t k~ 1 -u 

u e , 


et oil k est positif, on obtient h l’aide de (8), apr&s quelques calculs 


_ /»0O CO B Z 

TTOn_ _ I e 

n) Jo 


r(n) 


'o (1 + 3 ) 


yrjH cos [2 + e sin 2 — n arctg 2 ] dz, 


u > 0, 
u < 0, 

n > 1. 


10. si r on consid£re la distribution de Poisson 


(38) 


da(x, t ) = 


e X x t 


t\ 


t = 0, 1,2,3, . . 


c’est-^-dire la “step-function” ayant le saut da(x,t ) aux points t = 0, 1,2, 
3, . . . , on obtient l’intdgrale de Stieltjes 


x(e’— 


et, par suite, 


l} = PV *</«(*,/) 

Jo 


<t>n(x ) = f t n da(x,t), 

Jo 


n > 0. 


Les polyndmes <t> n (x ) sont done les moments de la distribution (38). On a 
aussi, pour toute valeur de s, 


t 1 s da( 1, t) - e x g(j). 


11. Mais l’expression la plus int6ressante des nombres a n parait etre celle 
qui se d6duit de la formule ( 11 ) 


(39) 


p xdz p 

Jo r(l -f- 2 ) Jo z(tt 


e~ xt dz 


(/ -f log 2 2 ) ’ 


R{x) > 0. 


-f. 


JL f 00 e *r(n) sin {w arctg(7r/log 2 ) } dz 

t Jo (tt 2 4- log 2 2 ) |n 


En remplagant x par e x , en diff6rentiant n fois et en tenant compte de (6), on 
obtient 

z n dz 

f 0 T(1 + 2 ) 

Equation qui est exacte m6me pour n — 0. 

Dans la formule (39), remplagons x par x e { et d6veloppons suivant les 
puissances de /, nous obtiendrons 

* . f x p xVds p e~«<t> n (-u)du 
n Jo r(l -f- 2 ) Jo u{tc“ -f- log* {x / 11 ) \ ’ 

Faisons dans cette Equation, successivement n = 0, 1, 2, . . . , p, rappelons 
nous que la factorielle j p (x) du §6 satisfait h la relation synbolique j p {4>) — x v 
et nous trouverons 

00 ~ t+9 dz 


R(x) > 0, n > 0. 


iv _ f ” X t+ *dz . . , P__£V 

J_ p r(i + 2 ) + ( ^ Jo «{/ + ! 


v du 
7 


log (pc/u)) 
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NOMBRES EXPONENTIELS A PLUSIEURS ARGUMENTS 


12. Nous indiquerons rapidement une generalisation des nombres expo- 
nentiels a n . Si on pose 

a n (a), o>') — (co<i -}■ o)'a') n = \ n )a ) n ~ i io ,i a n -i at, 

<-0 \ i / 

w et cd' etant deux constantes, la fonction generatrice des nombres u') sera 

e e ~ e e =2 a »("» «') ~7 

n-0 

et, en multipliant les deux membres de cette egalite par e** et developpant 
suivant les puissances de z, on verra que 


OO CO 


E v»/(r + ww + W) 2 it i /• /m 

2. = «/l* + « )1 

W! W! 


(40) 

m— 0 it— 0 

oil /(w) designe un polyndme arbitraire. On peut ainsi former des nombres a n 

dependant d’un nombre quelconque d’arguments w, o', w" En particular, 

pour fa) = 1, fa)' = — 1, on aura 

a*(l, -1) = q kf 

q k designant les nombres definis par (19) au §3 et, par suite 


t t fix+ J n 7 n) -^ + i) 

m— 0 n— 0 ttl . tl . 


m— 0 n- 

symboliquement. C’est la propriete sommatoire des nombres q n . On a notam- 
ment 


(m — n) 


2k 




m— 0 n— 0 HI. tl. 

On peut generaliser d’une maniere analogue les polyn6mes 4>„(x). En parti- 
culier, on verra que 

oo oo m-f n / \k 

Y' Y' ^ W y 2x 

^=0 m! «! 

oil Xn(x) est le polynSme introduit au §7. 


= e x X k(x) 


Nombres de Bernoulli 

13. Les nombres de Bernoulli b 0 , bi, bz , . . . sont definis par 

-T^-r = E ^T- . 6.-1. 6. = - 1. 6. - 5. ... i 6..+. = 

6 — I 0 «! 


0, n > 1, 


et nous cherchons une suite de polyn6mes Q n (x) orthogonaux symboliquement 
par rapport k ces nombres, c’est-ft-dire tels que Ton ait symboliquement 


(41) 

K n etant une constante. 


6 ’o.m = ft ,, 0 * 


I 


I 
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II est n6cessaire pour cela, comme nous l’avons remarqu6 au §4, de savoir 
mettre sous forme de fraction continue d’un type appropri6 (7, chap. 9) la 
s£rie asymptotique 


(42) 


*(* + D = ^ + ^ + ^i + .... 


XXX 

Les polynfimes Q n (x) seront alors les d£nominateurs des r6duites successives. 
On sait par la th£orie de la fonction r que 


(43) 


1 


, / , d logr(x +1) 

+ {x + 1) = d? = SoT+^j 5 ' 


et il se trouve que Ton d6duit imm6diatement d’un r£sultat de Stieltjes (9, 
p. 378) la fraction continue ayant le type qui convient 

2 I Xi I X2 I A3 I 
Hx + 1) = fo.. ' , J + i n" , J + f— ' T~ J + rn - , J + . . 


2x + 1 


oil 


2x -f 1 

An = 


\2x + 1 ' \2x + 1 


n 


4b - 1 ’ 


Les polyndmes Q n (x) v6rifient done la formule de recurrence 


(44) 


n 


Q h +i(x) = (2x + 1) Q n (x) + 4 x 


Qn-l(.x) 


k l’aide de laquelle nous avons calcul6: 

<2o = 1, 

Qi - 2x 4- 1, 

Qt = 4^ac 2 + * + ^ , 

@ 3 = 4^2x s + 3 x 2 + x + , 

Q, = 16^x 4 + 2x’ + yl’ + y* + H) ■ 

Qi - 16 ( 2 ** + 5x 4 + f *' + f *’ + fty* + f[) . 

<?. = 64(1* + 3x‘ + jy x‘ + yy X 1 + yj X + , 

Qi = 64(2*' + 7 *’ + ff + 

Q. = 256^x 8 + 4 x 7 + ^x* + ^x 8 + ^|yx 4 + ~x' 


6559 , , 4949 , , 198 
~m x ~\A3 X +1F 


, 42(>\ 

* + i43)' 


= 256^ 


88316 , 12176 

+ 2145 * + 715 

&• + 9.* + f *’ + iff,' + x 4 


X + *J*) 

^ 143/ ’ 


, 71756.3 . 47340.2 , 1026576 . 36288 

i on i X “T nni X “T ln irr X -f“ 


221 


221 


12155 


2431 


) 


L 
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Comme verification de ces expressions, nous observerons que, d’apr£s (44), on a 


n 




Qn- l{~i) 


d’oil l’on d6duit 


(45) 


(?2»+l(— 3) “ 0, 

n ( M • • (2w — l)] 4 

*> L3-5-7* • • (4n — 1) * 


(4n - 1) 

La constante i£ n qui figure dans (41) s’obtient en multipliant les deux membres 
de (44) par x" -1 et en remplagant x symboliquement par b, ce qui donne 

4 


« 


0 = 2 ^ + 4^T Ji: »-‘ 
et comme Ko = b 0 Qo(b ) = 1, on obtient 

(- 1 ) B 1 [ ft !] 4 


(46) 


K n = 


2n + 1 2 B [1 -3-5* - - (2w -I)] 5 ' 

Le coefficient de x n dans Q„(x) £tant 6gal k 2 n , on a les relations d’orthogonalite 
symbol ique t 

(47) Q m (b) Q,(b ) = I® 1 m * 

oil K n a la valeur (46) . 

Si Ton consid&re la s£rie 


12 n K n 


m — n, 


y(z) = Z j, 

n-0 fl\ 

on peut former au moyen de (44), une equation differentielle lineaire du 
troisieme ordre 5. laquelle elle satisfait et montrer qu’elle converge pour 
\z\ < 2. C’est ce qu’on verifie, en faisant x = — k l’aide de (45) et de la 
formule de Stirling. Mais l’expression generate des polyndmes Q n (x) nous 
echappe. 


14. Les formules symboliques (47) donnent naissance k une orthogonalite 
veritable de la mantere suivante. 


II est facile de demon trer, au moyen du calcul des residus et en vertu de 
(43), que 


iK* + 1) 


_L r +U ° _cot rz 

2 t { Je-fco T (z — X) 2 


-1 < c < 0 


le point x etant soit k droite soit k gauche de la droite d’integration d’abscisse 
c. Int^grons par parties, nous obtenons 


• pc+iao 


dz 


sin vz z 


En developpant 1 /(z — x) suivant les puissances de 1/x, sans avoir egard k 
la convergence et en comparant k la serie asymptotique (42), on est conduit 
5 penser que 
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(48) 


_ ii C e+Ux> * dz 

2 J c-i™ sin 2 *-* 


c—ia> Sm 7T2 


-1 < c < 0, 
n = 0, 1, 2, ... . 


Or cette expression des nombres de Bernoulli qui parait etre rest£e inapergue 
ou, en tous cas, peu employee n’est en definitive, qu’une application d’une 
belle formule, 6tablie par Jensen, pour la fonction £(s) de Riemann, et rap- 
pe!4e par Lindelof (6, p. 103), savoir 


(49) fr - 1) Us) = 4rJ"(i + /’)»“-> iL 


Si 1’on tient compte, en effet, que 

6. = (-l)""‘nf(l - «), n- 0,1,2,... 


et que 

(£ 4- ti) n + (| — ti) n = 2(? + / 2 ) in cos (n arctg 2t ) 
la formule (49) donne 

(50) K = UT (r .Y;-.y 

qui se ram£ne imm6diatement k (48). Les deux formules (48) et (50) convien- 
nent Tune et l’autre pour exprimer I’orthogonalite (47) des polyndmes Q n (x). 
En choisissant (48) on aura 

_ W C c+ico Q m (z) Q„(z) ^ = jo, m n, 

2 Jc-ioo sin 4 irz \2 W 2C B , m = n, 

oil — 1 < c < 0 et oil K n a la valeur (46). 
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ON SOME POLYNOMIALS OF TOUCHARD 

MAX WYMAN and LEO MOSER 

In the preceding paper Touchard considers a set of polynomials Q n (x) 
defined by 

(1) <?n+l(*) = (2x + 1) Q n (x) + Qn-l( x )> Q-n(x) = 0, Qo(x) = 1. 

Touchard uses (1) to compute (?„(*) for 0 < n < 9 and also finds Q n {— 2 )- 
He remarks however ‘Texpression g6n£rale des polynomes Q n (x) nousechappe.” 
The object of this note is to derive an explicit expression for Q n (x). 

Under the substitution 



the conditions (1) become 

(3) (n + 1 )W. +l = (2* + 1)(2 n + 1) W. + W'_(*)=0, W,(x) = l. 

Now define the generating function 

(4) W{f) = E w. £ . 

n=0 n\ 

The conditions (3) then imply 

(5) t(t* - 1)^ + (32* + 2(2* + - 1)^ +(t + 2x + 1 )W - 0, 

H^O) = 1. 

Equation (5) is a special case of Heun’s equation. Its solution can be 
obtained in the following way: Let 

(6) W = (1 - o~ (to+1 V 2 = t\ 

Then (5) becomes 

(7) z(z — l)^r 4- {1 — (1 — 2 *)z}^T “ x * w = w(0) ~ !• 

This is the well-known hypergeometric equation. The only solution regular 
at z = 0 and satisfying the boundary condition is 

(8) w = F(-x, — x , 1, +z). 

Hence from (6) we obtain 

(9) W = (1 — t)- (ax+1) F(-x, -x, -1, / 2 ). 
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Since 


w n = 


d n W 

dt n 


<=o 


n\ T(2x+«-2r+l) T*(r-x)(2r)l 

~ X)) 5W W 


(9) implies 

(10) w.«(r(-(2*+i)) r ! ( 

-4:( 2 '. + -" J 7 2r )(;)' 

By (2) and (10) an explicit expression for Q n (x) is 


This of course checks with the values of Q n {x) computed by Touchard for 
0 < n < 9, and also gives his value of (?„(— 2 )* Finally, the expression (11) 
simplifies considerably for x a positive or negative integer and for 2jc a negative 
integer. Thus for example 

(12) G.(0) = 2”b!( 2 ”) ‘ 


and 

(13) (?n(l) = 2"n!(n 2 + n + • 


Equation (13) provides still another simple check on values of Q n {x ) computed 
from the recurrence formula. 


University of Alberta 
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ON THE THEORY OF RING-LOGICS 


ADIL YAQUB 

Introduction. Boolean rings ( B , X, +) and Boolean logics ( = Boolean 
algebras) ( B , C\, *) are equationally interdefinable in a familiar way (6). 
Foster’s theory of ring-logics (I; 2; 3) raises this interdefinability and indeed 
the entire Boolean theory to a more general level. In this theory a ring (or an 
algebra) R is studied modulo K, where K is an arbitrary transformation group 
(or “Coordinate transformations”) in R. The Boolean theory results from the 
special choice, for K, of the “Boolean group,” generated by x* = 1 — x 
(order 2, x** = x ). More generally, in a commutative ring (R, X, +) with 
identity the natural group N, generated by x A = 1 + x (with x y = x — 1 as 
inverse) was shown to be of particular interest. Thus specialized to N, a 
commutative ring with identity ( R , X, +) is called a ring-logic, mod N, if 
(1) the + of the ring is equationally definable in terms of its iV-logic ( R , 
X, A , v ), and (2) the + of the ring is fixed by its iY-logic. It was shown (2) 
that each p- ring (5) is a ring-logic mod N. It was further shown (3) that each 
p k - ring (3; 5) is a ring-logic mod D , where D is a somewhat more involved 
group. 

All these known examples of ring-logics have zero radical, and the question 
presents itself: do there exist examples of ring-logics (modulo a suitable group) 
with non-zero radical? We shall answer this in the affirmative. Indeed, we shall 
show that the ring of residues mod n ( n arbitrary) is a ring-logic modulo 
the natural group N itself. 

1 . The ring of residues mod p k . Let (R, X, -f~) be a commutative ring 
with identity 1. We denote the generator of the natural group N by 

(1.1) x A — 1 + x, 
with inverse 

(1.2) x y = x - 1. 

As in ( 1 ), we define 

(1.3) a Xjb = (a A X & A ) V . 

It is readily verified that 

(1.4) a X A b — a -f* b -f- ab. 

The following notation is used (2): 

* An = ( • • . ((*Y) • • -) A ; * VB = ( . . • ((*T) • . -)\ 

n iterations. Again 

x Atn = (x A *) n ; * vw = (x vk ) n . 

Received September 9, 1955. 
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We now consider ( R pk , X, +), the ring of residues mod p* ( p prime) and 
prove the following 

Theorem 1. (R pk , X, -T) is a ring-logic (mod N). The ring -f- is given by the 
following N -logical formula 

(1.5) x + y= { (x(yx pk ~ pk ~ l ~ 1 ) A )x pk ~ pk ~ 1 } X A 

{ (x A (y (x A )* k -* k - l - l yy tf k -* k - 1 y 2 } . 

Proof. By Euler’s generalized form of Fermat’s Theorem, we have 

(1.6) a pk ~ pk ~ 1 = 1, a e R pk , 

a not divisible by p. We now distinguish two cases: 

Case 1: Suppose p does not divide x. Then, by (1.6), the right side of (1.5) 
reduces to 

{*(1 + yp >k ~ pk ~ l ~' L ) • 1 } X A 0 = x + y X pk ~ pk ~ 1 = x + y, 

since 

. 0; o X A 0 = a. 

This proves (1.5). 

Case 2: Suppose p divides x. Then, clearly, p does not divide x* = 1 + x. 
Hence, using Case 1, the right side of (1.5) reduces to 

0 X a {(* a (1 +y(x A ) pk - pk ' l - 1 )) v -lj = (* A + y(x A ) pk - pk -'y 

= ( x A + y) v = * + y, 

since 

(*>*-p‘-i) V2 = 0 vS =l;0X A fl = a. 

Again (1.5) is verified. Hence (Rpk, X, +) is equationally definable in terms of 
its N- logic. Next, we show that ( R pk , X, +) is fixed by its iV-logic. 1 Suppose 
then that there exists another ring ( R v t , X, +')> with the same class of 
elements Rpk and the same X as ( R pk , X, +) and which has the same logic 
as (R pk , X, +)• To prove that + = +'. Again we distinguish two cases. 

Case 1: p does not divide x. Then 

x +'y = x(l +'y X * k - 1>k - 1 - 1 ) = x(yx pk - pk ~ 1 -') A = x(l + yx? k - pk ~ x - 1 ) = * + y, 
since, by hypothesis, x* = 1 + # = 1 +'*. 


*A ring ( R , X, +) is said to be fixed by its iV-logic if there exists no other ring (R, X, +')» 
on the same set R and with the same X but with +' J* -K which has the same N- logic; 
i.e., 

** * 1 + * « 1 +'x ; x v — x — 1 — x -'1. 
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Case 2: p divides x. Then, clearly, p does not divide x A = 1 + *. Hence, by 
Case 1, 

x +'y — x A -\-’y y = x A + y v = x + y. 

Therefore +'= + , and the theorem is proved. 

Corollary. ( R p , X, -f) = (F„, X, +), the ring {field) of residues (mod p), 
p prime , is a ring-logic (mod N) the + being given by setting k = 1 in (1.5), 
and making use of x p = x: 

(1.7)* x + y={ (x(x>-*y) A )} X A { (^((ac A ) p - 2 y) A ) v (^- 1 ) v2 }- 

2. The ring of residues (mod «), n arbitrary. In attempting to 
generalize Theorem 1 to the residue class ring ( R n , X, +)> where n is any 
positive integer, the following concept of independence, introduced by Foster 
(4), is needed. 

Definition. Let 21 = {2L, 2L. • • . . 2l»} be a finite set of algebras of the same 
species <3. We say that the algebras 2li, 2L, . . • , 2l» are independent if, corres- 
ponding to each set {<£,} of expressions of species © {i = 1, . . . , »), there 
exists at least one expression X such that 

<t>i = X (mod 2l<) (i — 1, . . . , n). 

By an expression we mean some composition of one or more indeterminate- 
symbols f, ... in terms of the primitive operations of 2li, 5L, • • • , 5f«; <t> — X 
(mod 21), also written as <f> = ^(2f), means that this is an identity of the 
algebra 21. 

We now prove the following 

Theorem 2. Let (21 1 , X, +),... , (21 1 , X, +) be a finite set of ring-logics 
(mod N), such that the N-logics (21 1 , X, A ), . . . , (21 1 , X, A ) are independent. 
Then 21 = 2ti X ... X2I : {direct product ) is also a ring-logic (mod N). 

Proof. Since 21 < is a ring-logic (mod N), there exists an iV-logical expression 
<j>i such that, for every x u y< € 2l< (i = 1, . . . , t), 

Xi + y< = <t>i = <f>i{xi, yu X, A , v ) = 4 >i(x h y t \ X, A ). 

In view of the independence of the logics, there exists an expression X such 
that 

(<tn (mod 21 0, 

x = L . . 

U* (mod 21,). 

Then, for a = {a u a 2 , . . . , a t ) € 21; b = {b ly b 2 , . . . , b , ) 6 21, we have 


*This formula is considerably shorter than the formulas for + given in (2; 3). 
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i.e. f 


X (a, Xi ) X ((dii a 2 , • • • i d|), (bi, b 2 , • • • » &i) 5 X i A ) 

= (Z(di, &i; X, A ),X(a 2 ,b 2 ‘, X, A ), . . . , X{a„ b t \ X, A )) 

= (dl + ^1, d2 + ^2i • • • i d t + b t ) 

— (di, a 2 , . . . , a t ) + (b i, 62 , • ♦ • , b t ) 

— a + b; 

a + b = X (a, b\ X, A )\ a, b £ SI. 


Hence, St = StiX . . . XSt t is equationally definable in terms of its N-logic. 
Next, we show that St is fixed by its i\Mogic. Suppose there exists a + / such 
that (St, X, 4*0 is a ring, with the same class of elements St and the same 
X as the ring (St, X, +), and which has the same logic (31, X, A ) as the ring 
(St, X, +)• To prove that + = +'• 

Now, let a = (d 1, d2, . . . , a t ) £ St; b = (bi, b 2 , . . . , b t ) £ St. A new 4 * / 
in St defines and is defined by new 4 - , i in Sti, +'2 in Stj, . . . , in St t , such 
that (Sti, X, + / i) is a ring, and similarly for (St 2 , X, +'*), .... (St„ X, +'i); 
i.e., 

^2 ® 4 * b — (di, d2, . • • 1 a 1) 4 * (61, b 2 , * • • , b t) 

— (di 4-'i&i, d 2 ~\~ r zb 2 , . . . ,a t 4 ~'tb t ). 

Furthermore, the assumption that (St, X, 4-') has the same logic as (31, X, 4-) 
is equivalent to the assumption that (31 1 , X, -F'i) has the same logic as 
(3ti, X, 4-), and similarly for (St f , X, 4-'i) and (Sti, X, 4~) {i = 2 
Since (Sti, X, 4*) is a ring-logic, and hence with its 4- fixed, it follows that 
4"'i = 4-; similarly +' 2 = -f, . . . , 4 t = +• Hence, using (2.1), 4-' = 4-, 
and the proof is complete. 

We shall now prove the following 


Lemma 3. Let pi, . . . , p , be distinct primes, and let 

{P-m , X, 4-), w i = p i — p tm i\ i — 1 , • . • , t t 

be a set of residue class rings (mod n t ). Then the logics ( R ni , X , A ) (i = 1, . . . , t) 
are independent. 

Proof. Let 

P(i) = Tl n J> J 5* *» 

j - 1 


Then, clearly 


(P(f),n t ) = 1. 

Hence, there exist integers r t > 0, s t > 0 such that 


P{i ) — Sint = 1. 


Now, define 


e(x) = x 


(ni— i»i)(n2— m2)... (n«— mi) 
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Then one easily verifies that, for i 9 * j, 

<*, = «,(*) = {«(*) X A ((e(ac ) 1 _ jUjW 

Now, to prove the independence of the logics ( R ni , X, A ), let {0 t } be a set of t 
expressions of species X, A ; i.e., a primitive composition of indeterminate- 
symbols in terms of the operations X, A ; then, if we define (cf. 4) 

X — <jnwi X A <fau2 X A . . . X A ^ t o) t , 

we immediately obtain 

<f>i = X(mod Rm), 

since a X A 0 = a = 0 X A a. This proves the theorem. 

Recalling the well-known fact that 

(2.2) ( R n , X, +) = R ni X . . . X R nt (direct product), 

n arbitrary, n = nj . . . n t , a combination of Theorems 1, 2, Lemma 3 and 

(2.2) readily yields 

Theorem 4 (Fundamental Theorem on R n as ring-logics). ( R n , X, +)» 
the residue class ring (mod n), n arbitrary , is a ring-logic (mod N). 

We conclude with several illustrative examples. 

Example 1. R v k = R 2 = F 2 = {0, 1}. 

It is readily verified that each of (1.5) and (1.7) reduces to the familiar Boolean 
formula 

(2.3) x + y - xy A X A x A y. 

Example 2. R p k = R 3 — F 3 — {0, 1, 2}. 

Formula (1.7) yields 

(2.4) x + y = {*0ry) A } X A {[(x A (x A y) A )] v (a: 2 ) y2 }- 
Compare with (1) in which the following formula was obtained: 

(2.5) x + y = xy^X^yX^y 2 . 

It is noteworthy to observe that the + of ( F pi X, +), the field of residues 
(mod p), p prime, may also be expressed in the following form: 

(2.6) * + y = {x(y*r 2 ) A } X A [y{x A x Ai . . . x Ap " , ) s }. 
or by 

(2.7) * + y = {ac(yx p-2 ) A } XjyOc’-T*}. 

The last formula, when specialized to F 3 , gives a simpler expression for + 
than (2.4). 
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Example 3. R v k = R22 = {0, 1, 2, 3}. 

Formula (1.5) reduces to 

(2.8) x + y = { (x(xyYx 2 ) } X J [(x A (x A y) A )] v (x 2 ) v2 }« 

It may be verified that the + in ( R 4, X, +) is also given by 

(2.9) x + y = { (xy) A (xy ) 2 X^xX^K*?) 42 } { foOfry)**} 4 . 

This last formula excels most of the others in obviously displaying the 
symmetry of -f-. 

Example 4. R n = Re = {0, 1, 2, 3, 4, 5}. 

The correspondence 

0 (O2, O3), 3 «-* (I2, O3), 

1 (I2, 13)1 4 <-» (O2, 13). 

2 <-> (O2, 23), 5 <-» (I2, 23), 

determines an isomorphism of Rz and R2 X R3 (direct product), where R 2 = 
{O2. 1 2 } and i?3 = {O3, 1 3) 23}. 

It is readily verified (compare with the proof of Lemma 3 and (2.3), (2.5) 
above) that 

(2.10) x + y = { (x/X A x A y)(x 2 X A (x 2 ) v2 ) As | 

X A { (xy 4 X A a: A y X a xV)(x 2 X a (x 2 ) v2 ) a, J- 

Formula (2.10) may be verified either by direct substitution from Rt, or 
via the R2 X Rz representation above. 
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MASCHKE MODULES OVER DEDEKIND RINGS 


IRVING REINER 

1. Introduction. We use the following notation throughout: 

o = Dedekind ring (8; 12, p. 83). 

K = quotient field of o. 

A — finite-dimensional separable algebra over K, with identity element 
e (6, p. 115). 

G = o-order in A (2, p. 69). 
p = prime ideal in o. 

K „ = p-adic completion of K. 

Op = p-adic integers in ATp. 

p* = irOp = unique prime ideal in o p . 

K = o/p = Op/p* = residue class field. 

By a G-ntodule we shall mean a left G-module R satisfying 

1. R is a finitely generated torsion-free left o-module. 

2. For x, y 6 G, r, s 6 R: 

(xy)r = x(yr), ( x + y)r = xr -f- yr, x(r + s) = xr + er = r. 

Following Gaschiitz and Ikeda (3; 5; see also 7; 10) we call a G-module R 
an M u -G-module (unterer Maschke Modul) if, whenever R is an o-direct 
summand of a G-module S, R is a G-direct summand of S. Likewise, R is an 
M^-G-tnodule (oberer Maschke Modul) if, whenever S/Ri is G-isomorphic to 
R where the G-module 5 contains the G-module Ri as o-direct summand, 
Ri is a G-direct summand of S. 

If all modules considered happen to have o-bases (for example, when o 
is a principal ideal ring), then we may interpret these concepts in terms of 
matrix representations over o. Thus, a representation T of G in o is an M 0 - 
representation if for every reduced representation 

( r A ) 

\0 A/ 

of G in o, the binding system A is strongly-equivalent (13) to zero, that is, 
there exists a matrix T (over o) such that 

A (x) = r (x) T — TA (x) for all x 6 G. 

(Likewise we may define an Af u -representation of G in o.) 

Received September 19, 1955; in revised fo. m December 14, 1955. This work was supported 
in part by a contract with the National Science Foundation. The author wishes to thank 
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329 


330 


IRVING REINER 


Starting with a prime ideal p of o, we may form G = G/pG, an algebra 
over K. If R is a G-module, then R = R/pR can be made into a G-module in 
obvious fashion, and R is then a vector space over K. The main results of this 
note are as follows: 

Theorem 1. If for each p, R is an M u -G-module (or M 0 - G-module ) , then R 
is an M u -G-module (or M q-G - module) . 

Theorem 2. If G is a Frohenius algebra over o, and R is an M u -G-module 
(or M q-G - module) , then for each p, R is an M u -G-module (or M^-G -module). 

The significance of Theorem 1 is that it reduces the problem of deciding 
whether an o-module R is an M tt -G-module to that of determining for each p 
whether the vector space R over K is an M^-G-module. Thus, we pass from a 
ring problem to afield problem, which is in general much simpler. 

In the important case where G = o (H) is the group ring of a finite group H, 
then G is semi-simple whenever p does not divide the order of H, and for 
such p the module R is automatically an .M-G-moduIe. More generally, we 
may form the ideal 1(G) of G defined by Higman (4); his results show that 
1(G) 9^ 0 in this case. From (9) we deduce at once that G is semi-simple 
whenever p does not divide 1(G). Therefore: 

Corollary 1. R is an M u -G-module (or M 0 -G-module) if for each p dividing 
1(G), R is an M u -G-module (or M 0 - G-module). (Note that only finitely many 
p’s are involved.) 

Now let G be a Frobenius algebra over o, for example, G = o(II). Then by 
(5) there is no distinction between Mq- and Af„-modules, and Theorems 1 and 
2 tell us that R is an Af-G-module if and only if for each p, R is an ilf-G-module. 
Using the concept of genus introduced by Maranda in (9), we have: 

Corollary 2. Let G be a Frobenius algebra over o, and let R, S be G-modules 
in the same genus. Then R is an M -G-module if and only if S is an M -G-module. 

2. p-adic completion. Theorem 1 will follow at once from two lemmas, 
of which we prove the more difficult first. Let R be a G-module, and define 

Gp — G <S> Op, Rp = Op <g> R, 
both products being taken over o. 

Lemma 1. If for each p, Rp is an M u -Gp-module (or M Q-Gp-module) , then R 
is an M u -G-module (or M q- G- module ) . 

Proof. (We give the proof only for M u -modules.) Let R be an o-direct 
summand of a G-module S. We wish to show that R is a G-direct summand of 
S, that is, that there exists / 6 Hom c (G, R) such that/|i? = identity. Using 
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the Steinitz-Chevalley theory (1; 11) of the structure of finitely gener- 
ated torsion-free modules over Dedekind rings, and taking into account 
the hypothesis that R is an o-direct summand of S, we may write 

5 = 2IiSi © ... © 2I n S n j R = SllSl © ... © 2f m S m i 

with m < n, where each 21 < is an o-ideal in K , and where Si, . . . , s n are linearly 
independent over K. For the remainder of this proof, let the index i range from 
1 tow, and j from 1 to m. 

To prove the lemma, it suffices to exhibit / £ Hom i4 (iC5, KR) such that 
f\KR = identity, and / maps 5 into R. (We use KS to denote the iC-module 
generated by S .) Let us set 

(1) f(Si) = 'Ea ij Sj, a tj e K, 

thereby defining / £ Homx(iC5, KR). Then / maps 5 into R if and only if 
for each a € 21 < we have aa i} € 21 ,, that is, if and only if 

(2) a i} 6 (21,: 210 for all i, j. 

On the other hand, the map / defined by (1) will be an A -homomorphism 
with f\KR = identity, if and only if for all x 6 G, s 6 S, r € R: 

f(xs) = xf(s), f(r) = r. 

Let us set 

G = o*i + . . . + ox t . 

This is possible since (2, p. 70) G is a finitely generated o-module. Then / is 
an A -homomorphism with/|Xi? = identity, if and only if 

(3) f(x k Si) = x k f(Si), f(sj) = Sj for all i,j, k, 

where the index k ranges from 1 to t. Equations (3) are a set of linear equations 
with coefficients in K, to be solved for unknowns {a,,} satisfying (2). 

From the hypotheses of the lemma we deduce that for each p, (3) has 
a solution {a<,} satisfying a i} € (21,: 21 do* for all i, j. Thus (3) is solvable over 
the extension field Kp of K, and hence is also solvable over K. The general 
solution of (3) over K is given by 

(4) a i} = tij/di j, e i} = e t ,(t) = &<, -+- ^2 c\]t vy 

where the b i} , c'/J , d tj are fixed elements of o, d i} 0, and where t ranges over 
all A^-tuples in K N . The general solution of (3) over Kp is also given by (4) 
by letting t range over Kp N . Then for each p, we can find t(p) for which 

(5) eij(t( p)) £ iQijOp for allf.j, 

where 93i,= (21,: 2f t )d,,. 
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For each p, let b( p) be the maximal exponent to which p occurs in the prime 
ideal factorizations of the ideals 33^. Then 6(p) =0 except for a finite set of 
primes. Set P = { p: & (p) > 0}, and choose an iV-tuple t with components in o 
such that (componentwise) 

t = t(p) (mod p 6(p) ) for each p € P. 

In that case, <?^(t) ss e</t(p)) (mod p &a>) ) for each p € P, and all i,j, whence 
by (5) we have 

(6) ordpc^t) > ord„93 tJ for all i,j, 

for all p ( P. But for p ([ P, equation (6) is certainly valid because e^(t) 6 o, 
and ordpSB i} < 0. Hence we deduce that e i} ( t) 6 33^ = (2t>: 21 i)d tj for all i,j, 
whence (4) gives a solution of (3) for which (2) holds. 

We may remark that this lemma is almost trivial when o is a principal 
ideal ring. 

3. Modular representations. Now let R p be a Gp-module, and define 
R p = R p /irR p , G p = Gt/nG). To complete the proof of Theorem 1, we need 
only show: 

Lemma 2. If R p is an M u -G r module (or M 0 -G 9 -module) t then R p is an M u -G 9 - 
module (or Mq-G ^- module). 

Proof. Since o p is a principal ideal ring, we may express the proof (given 
here only for Af 0 -modules) in terms of matrix representations. We must show 
that if F is a representation of G p in o p for which P (the induced modular 
representation of G p in K) is an Af 0 -representation, then in any reduced 
representation 

(s t) 

of G p in Op, the binding system A is strongly-equivalent to zero. 

We may write G p = o p yi © ... © 0 py„, G p = Kyi © ... © Ky n . We shall 
show the existence of a matrix T over o p such that 

(8) A(y t ) = T(y { )T - TA(y,) for each *, 

where in this proof the index i ranges from 1 to n. By taking residue classes 
mod p*, the representation (7) gives a representation 

e a 

of Gp in K. Since T is by hypothesis an Af 0 -representation, the binding system 
X is strongly-equivalent to zero over K. Therefore there exists V\ over o p 
such that 

(9) A (y t ) = r(y,)Fi — FiA(y t ) + ttA (1) (y<) for each i, 
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where A (1) is also over Op. But then (7) with A replaced by A (1) gives another 
Op-representation of Gp, whence the same argument shows 

A <»(*) = T(y t ) V 2 - VMy t ) + ir A< 2 >(y,) for all i, 

where V 2 and A (2) are over Op. Continuing in this way, we obtain a solution of 
(8) given by T = V\ + ttV 2 + t 2 V 3 4* • • • • 

This proof could also have been stated in terms of cohomology groups. 

4. Frobenius algebra. Suppose in this section that G is a Frobenius 
algebra over o, that is, there exist o-bases {#<}, {*>*} of G (called dual bases) 
such that the right regular representation of G with respect to {t; < } coincides 
with the left regular representation with respect to {«<}. Assume that G has 
an o-basis containing e. Ikeda showed (5) that M 0 - and Af u -moduIes were the 
same, and that a G-module R is an Af-G-module if and only if there exists an 
o-endomorphism <t> of R such that 

(10) £ Ui<t>Vi = identity endomorphism of R. 

Gaschiitz (3) had shown this for the case where G = o (H), H — finite group, 
with (10) replaced by: 

(11) £ h<t>h~ l — identity endomorphism of R. 

htB 

We may use Ikeda’s result to obtain an immediate proof of Theorem 2. 
By hypothesis, R is an M-G-module, whence (10) holds for some o-endomor- 
phism <f>. But then clearly <f> induces a iGendomorphism <£ of R , and 
J2 u i < t> v i — identity endomorphism of R, so that R is an Af-G-module. 
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STANDARD AND ACCESSIBLE RINGS 

ERWIN KLEINFELD 

1. Introduction. A ring is defined to be standard (1) in case the following 
two identities hold: 

(1) (wx, y, z) + ( xz , y , w) ■+■ (wz, y, x) = 0, 

(2) (x, y, z) + ( 2 , x, y) - (x, z, y) = 0, 

where the associator (x, y, z) is defined by (x, y, z) = ( xy)z — x(yz). Albert 
has determined the structure of finite-dimensional, standard algebras (I). 
The simple ones turn out to be either Jordan algebras or associative ones. 

We focus attention here on a more general class of rings, which we shall 
call accessible. By permuting w and x in (1) and subtracting from (1) we 
obtain the identity 

(3) ((w, x), y, z) = 0, 

where the commutator ( w , x) is defined by ( w , x) — wx — xw. A ring is called 
accessible in case identities (2) and (3) hold. Thus a standard ring is auto- 
matically accessible. On the other hand, while (2) and (3) hold in any commu- 
tative ring, (1) need not. 

The structure of accessible rings, without finiteness assumptions, can readily 
be determined. An accessible ring is defined to be simple in case it has no 
proper two-sided ideals. Simple, accessible rings are either associative or 
commutative. From this result it follows trivially that simple, standard rings 
of characteristic different from 3 are either Jordan or associative rings. A 
structure for semi-simple, accessible rings is given, utilizing the Jacobson- 
Brown radical and the fact that primitive, accessible rings are either associative 
or commutative. 

The following result may also be of interest. If an accessible ring has no 
nilpotent ideals other than zero, then it is isomorphic to a subdirect sum of an 
associative and a commutative ring. Hence all identities common to the class 
of rings, which consist of all associative rings and all commutative rings, must 
hold in such a ring. 

The methods of proof are quite elementary. Identities are obtained which 
enable the construction of certain significant ideals. 

2. Preliminaries. Substituting z — y in (2) one obtains the flexible law, 
(y, x, y) = 0. A linearization of this identity yields (y, x, z) — — ( 2 , x, y). 

Received September 26, 1955. This research was supported in part by a grant from the 
National Science Foundation to the University of Connecticut. 
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As Albert observed, it can now be seen that (2) is equivalent to the flexible 
law and the identity 

(x, y, z) + (y, z, x) + ( 2 , x, y) = 0. 

We note that similarly (1) is equivalent to (3) and the identity 

(wx, y, z ) + (xz, y, w ) + ( zw , y, x) = 0. 

In an arbitrary ring the identity 

(xy, 2 ) = x(y, 2 ) + (x, z)y + (x, y, 2 ) + ( 2 , x, y ) - (x, 2 , y) 

holds. Thus (2) is a consequence of the commutative law, as well as of the 
associative law. Moreover the identity 

(4) (xy, 2 ) = x(y, 2 ) + (*, z)y, 

holds in every accessible ring. 

Another identity which holds in an arbitrary ring is 

(5) (wx, y, 2 ) - (w, xy, z) + (w, x, yz) = w(x, y, 2 ) + (w, x, y)z. 

The nucleus of an accessible ring is defined as the set of all elements n in R 
with the property (w, R, R) =0. If n is an element of the nucleus N of R, 
then because of the flexible law (R, R, n) = 0. Finally, because of (2), it 
follows that also (R, n, R) — 0. If n is substituted for w in (5), it becomes 
obvious that 

(6) (wx, y, 2 ) = w(x, y, 2 ), n 6 N. 

The center C of R is defined as the set of all elements c in N which have the 
additional property that (c, R) = 0. 

We now proceed to develop further identities that hold in arbitrary acces- 
sible rings. The elements u, v, w, x, y, z will denote arbitrary elements of such 
rings. 

Through repeated use of (4) one may break up ((w, x, y), 2 ) as 

((• w , x, y), z) = (wx-y — w-xy, 2 ) = wx • (y, 2 ) + w(x, z)-y 

+ (w, z)x-y — (w, 2 ) -xy — w-x(y, z) — w • (x, z)y 
= (w, x, (y, 2 )) + (w, (x, 2 ), y) + ((w, 2 ), x, y). 

Since (3) implies that every commutator is in the nucleus, we obtain 

(7) ((w, x, y),z) = 0. 

Because of (6) and the fact that every commutator is in the nucleus we get 
(v, x) (x, y, 2 ) = ((v, x)x, y, z). It follows from (4) that (v, x)x = (vx,x). 
Consequently 

((v, x)x, y, 2 ) = ((vx, x), y, z) = 0. 
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Therefore (*>, x ) (x, y, z) = 0. 1 A linearization of this last identity becomes 

(8) (v,w)(x,y,z) = - (v,x)(w,y,z). 

One can now prove that a product of a commutator with an associator 
always lies in the center. First one notes that 

((i>, w) ( x , y, z ), u) = ((v, w ), u ) (x, y, z ), 

because of (4) and (7). From the definition of the commutator it follows that 

((v, w),u)(x, y, z) = - (u, (v, w))(x, y, z). 

It is this last form to which we apply (8) to obtain 

- («, ( v , w))(x, y, z) = (w, x)((v, w), y , z). 

Finally (3) tells us that ((z>, w), y, z) = 0, so that (u, x)((v, w), y, z) — 0. 
Consequently ( (v, w) (jc, y , z) , u) =0. It remains only to prove that 
(z/, w)(x, y, z ) lies in the nucleus. It is easily seen that 

((». w)(x, y . 2). u) = (v, w)((x, y, z), t, u), 

using (6) and (3). At this point (8) is employed to yield 

(v, w)((x, y, z), t, u) = - (v,(x, y, z) )(w,t, u). 

But (v,(x, y, z)) =0 was proven with (7). Consequently ((r, w)(x, y, z ), t, u ) =0. 
We have established that 

(9) ( v , w) ( x , y, z) £ C. 

Now let us consider the element [(t>, w){x, y, z)] 2 . Clearly 

[( v , w) (x, y,z)] 2 = ( v , w) (x, y, z) ( v , w) ( x , y, z) = - (t>, x) ( w , y, z) ( v , w) ( x , y, z), 

using (3) and (8). On the other hand ( w , y, z)(v, w) = ( v , w)(w, y, z), because 
of (7). But we have already noted that (v, w) (w, y, z) = 0. Thus we have 
proved that 

(10) [(w, w)(x, y, z)] 2 = 0. 

3. Structure theory. Decently behaved rings have no nilpotent elements 
in their center. For let R be any ring with nilpotent elements in its center. 
Then there must be an element c^0 and in the center of R such that c 2 = 0. 
Consider the ideal D generated by c. It consists of all elements of the form 
ic + cx, where i is any integer and x an arbitrary element of R. It is now easy 
to verify that D 2 = 0, and so R has a non-zero, nilpotent ideal. 

Henceforth we shall be considering accessible rings R without nilpotent 


‘Independently R. L. San Soucie has announced in Abstract 672, Bull. A. M. S. 61 (1955) 
that rings satisfying (3), which have no divisors of zero, are either associative or commutative. 
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elements in their centers, unless otherwise noted. An immediate consequence 
of this assumption, taking into account (9) and (10), is that 

(11) (», w) (x, y, z) = 0. 

But then one can also obtain from (4) and (11) that 

(»(*. y, z ) , w) = ( v , w)(x, y, z ) = 0. 

Also 

((», w)x, y, z) = (i>, w ) ( x , y, z) = 0, 

because of (6) and (11). This last information allows us to construct ideals 
A and B in R, which have rather interesting properties. Let A consist of 
all finite sums of elements of the form (x, y , z ) or of the form w(x, y, z ). This 
set A, as may be readily verified, is an ideal even in an arbitrary ring. It is the 
smallest ideal modulo which the ring is associative. With the present assump- 
tions, namely that R is accessible and has no nilpotent elements in its center, 
we can assert that for any element a in A we have (a, R) = 0. 

Let B consist of all finite sums of elements of the form (x, y) or of the form 
(x, y)z. In an arbitrary ring this set need not be an ideal, but by virtue of (3) 
and (4) it can be shown to be one. In addition it is also true that B is con- 
tained in the nucleus N. B is also the smallest ideal modulo which R is com- 
mutative. 

From previous remarks, in conjunction with (7) and (11), it becomes clear 
that for any element a in A and any element b in B we must have ab = 0. 
Therefore AB = 0. Suppose that x is an element of A C\ B. Then since AB = 0, 
X 2 = 0. But x lies in the center of R because of the previously mentioned 
properties of A and B. Hence x = 0. 

At this point several theorems may be established. 

Theorem 1. A simple , accessible ring R is either associative or commutative. 

Proof. If R has nilpotent elements in its center then the ideal D described 
previously is different from zero, so that D = R. Since D 2 = 0, R must be a 
trivial ring, which is both associative and commutative. The only remaining 
case is the one in which R has no nilpotent elements in its center. Then the 
ideal B constructed above is either zero or the whole ring. If B = 0 then R is 
commutative, while if B = R then R is associative, since B is contained in the 
nucleus. This completes the proof. 

By substituting w = x and z = x in (1), one obtains 3(x 2 , y, x) = 0. 
Consequently, in a ring in which 3a = 0 implies a = 0 and which satisfies 
the identity (1), the Jordan identity (x 2 , y, x) = 0 must hold. Therefore a 
commutative, standard ring of characteristic not 3 is automatically a Jordan 
ring. It follows as an immediate Corollary to Theorem 1 that a simple, standard 
ring of characteristic not 3 is either a Jordan ring or associative. This is a 
generalization to rings of the theorem of Albert’s mentioned in the introduction. 
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THEOREM 2. If an accessible ring R has no nilpotent ideal other than zero , 
then it is isomorphic to a subdirect sum of an associative and a commutative 
ring. 

Proof. By assumption R can have no nilpotent ideal other than zero, so 
that D = 0. Hence R has no nilpotent elements in its center. Consider the 
natural homomorphism from R into R/A © R/B. The kernel of this homo- 
morphism is A r\ B — 0. Hence R is a subdirect sum of R/A and R/B. 
We have already noted that R/A is associative and that R/B is commutative. 
This completes the proof of the theorem. 

The following is a direct consequence of Theorem 2. If an accessible ring R 
has a maximal nilpotent ideal I then R/I satisfies the conclusion of Theorem 
2. Any expression involving elements of R, which would be automatically zero 
if the elements came from either an associative or a commutative ring, there- 
fore must generate a nilpotent ideal of R. Of course the definition of acces- 
sibility requires only that two expressions, namely those occurring in (2) and 
(3), be zero. 

The last result is concerned with a conventional type of decomposition, 
the introduction of a radical. Since the class of accessible rings includes the 
associative ones, the maximal nilpotent ideal will in general prove an unsatis- 
factory radical. We turn to a larger radical, namely the generalization of the 
Jacobson radical suggested by Brown (2). From this paper it follows that an 
accessible ring is semi-simple if and only if it is isomorphic to a subdirect 
sum of primitive accessible rings. A ring is defined as primitive in case it 
possesses a regular maximal right ideal F, which contains no two-sided ideal 
of the ring other than the zero ideal. 

We assert 

Theorem 3. A semi-simple, accessible ring is a subidrect sum of primitive 
accessible rings. A primitive, accessible ring is either commutative or associative , 

Proof. Only the second statement remains to be proved. Let R be a primi- 
tive, accessible ring and F a regular maximal right ideal of R which contains 
no two-sided ideal of R other than the zero ideal. The first step will be proving 
that R is prime. That is to say, if G and II are ideals of R such that GH = 0 
and G 0, then H = 0. We note that G F, so that R = F -f G. Then 

RII = (F + G)H = FII + GH = FH C F. 

For arbitrary elements x and y in R and h in H we have 

( x,y,h ) = xy -h — x -yh = xy -h — xh' £ RII. 

But (x, y, h) = — ( h , y, x), because of the flexible law, so that ( h , y, x) 6 RH. 
Finally, by means of (2), it can be shown that (y, h, x) € RH. At this point 
it is easy to see that RH is an ideal of R. Since RH C F, then in fact RH = 0. 
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The regularity of F assures the existence of an element/ in R with the property 
that for all x in R, fx — x is always in F. Then in particular fh — h — — h 
is an element of F. Consequently H C F. Since H is an ideal, H = 0. Since 
a prime ring has no nilpotent ideals other than the zero ideal it has no nilpotent 
elements in its center. As previously shown this implies that the ideals A, B 
of R have the property AB — 0. Hence either A = 0, in which case R is 
associative or B — 0, in which case R is commutative. This completes the 
proof. 
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1. Introduction. In a well-known paper (7) Jacobson has shown how his 
structure theory for arbitrary rings can be applied to give more precise infor- 
mation about the so-called “algebraic” algebras. This specialization of his 
general theory is, however, perhaps not completely satisfying in that it deals 
only with algebras, i.e. rings admitting a field of operators, whereas neither the 
general structure theory nor the definition of the property of being “algebraic” 
seems to depend in any essential way on the precise nature of the operators. 

In this paper we first show (§2) how, by suitably extending the algebraic 
concept to rings with arbitrary operators, Jacobson’s theory of algebraic 
algebras can be carried over without difficulty to all “algebraic rings.” Our 
definition of the algebraic property for arbitrary rings seems a natural one 
(and indeed almost inevitable if the link with x-regularity is to be preserved), 
and in §3 we establish some general results connected with this definition. 
The first of these is unspectacular, and in any case applies only to algebras; 
it serves chiefly as a lemma for a theorem proved later (§5). However, the 
second result, whose hypothesis actually excludes fields as operators, is more 
surprising, having the corollary that every ring algebraic over the integers and of 
zero characteristic must in fact be nil ; thus the algebraic property, as defined 
here with respect to arbitrary operator domains, can, for some choices of the 
operators, and in contrast with its more usual role of “weak finite-dimen- 
sionality,” be a very strong one. 

In the remaining sections we investigate various related questions. Thus 
in §4 we generalize the familiar result that a finite-dimensional matrix algebra 
over an algebraically closed field must be commutative whenever every matrix 
in the algebra can be reduced to diagonal form by a similarity transformation 
(allowed in the first instance to depend on the matrix); our generalization 
(which is applicable to algebras over any field, and indeed to arbitrary rings) 
has a certain topical interest in view of some recent work of Motzkin and 
Taussky. One of the new results in §5, while again referring only to algebras, 
generalizes Jacobson’s result that every algebraic algebra without non-zero 
nilpotent elements, over a finite field, is necessarily commutative; we show 
in particular that the conclusion remains true even if non-zero nilpotent 
elements exist, provided these are all central. The earlier results of §5 are of a 
rather curious and apparently superficial type, but do nevertheless have some 
unexpected implications (e.g. that, if a 7r-regular, or in particular algebraic, 
ring R has all its nilpotent elements central, then the same is true of every 
homomorphic image of R ). 
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We recall Herstein’s result (5) that if, to each element x of a given ring R, 
there corresponds a polynomial p x {\) with integral coefficients ( and possibly a 
constant term) such that x — x *p x (x) lies in the centre of R, then R must be 
commutative. We shall refer to this as H er stein' s theorem, and apply it in §5 
and §6, where we show how certain analogous results, and a few special cases 
of a related conjecture of Herstein, can be deduced from our earlier work. 

2. Preliminaries. Throughout, R will denote any associative ring, not 
necessarily commutative or containing a unit element, admitting an arbitrary 
commutative ring F of operators (i.e. endomorphisms a of the additive group 
of R, subject to a(xy ) = ( ax)y = x(ay) for all x, y £ R); we may suppose 
without loss of generality that F contains the identity operator. The case of a 
“ring without operators” is included in this scheme on taking F to be just the 
ring of integers (or an appropriate quotient ring). When we refer to subrings 
(etc.) of R these should always be understood as sub-F-rings (etc.), i.e. as 
being mapped into themselves by every operator in F. 

If one seeks to introduce an analogue, at this level of generality, of the 
property of an algebra of being “algebraic over its field of operators,” one may 
(cf. 3) think first of calling an element x of R algebraic over F if a positive 
integer n and elements a\, . . . ,a n of F , not all zero, exist satisfying 

(1) a\X -f- cnx 2 + . . . + cunX" = 0. 

This of course reduces to Jacobson’s definition when F is a field. However, 
this form is unsatisfactory from many points of view, as will become clearer 
below; we note for the present that it would not even enable us to carry over 
to rings the well-known property of algebraic algebras of having nil Jacobson 
radical. We therefore adopt a more stringent defining condition: we shall 
now call x algebraic ( over the ring F of operators ) whenever au, . . . , a„ exist as 
above but with the further property that the first non -vanishing a t is the 
identity operator, i.e. only if x satisfies an equation of the (“lower monic”) 
form 

(2) *“ + + . . . + a,x" = 0; 

if R happens to be an algebra, i.e. if F is a field, then this can of course always 
be arranged (on multiplying through by the inverse of the lowest non-zero 
coefficient) whenever x satisfies the formally weaker condition (1). Another 
equivalent form of our new definition is the following: x is algebraic if we can 
find a positive integer m = m(x), and an element a = a{x) of the subring 
generated by x, such that x m = x m a. We call R itself algebraic over F if each 
x £ R is algebraic over F. 

It is a straightforward matter to check that all the principal arguments and 
results of Jacobson’s paper (7) on algebraic algebras are valid, with only 
slight verbal changes, for the wider class of algebraic rings; we omit the details. 

It is important to bring out into the open a point which might otherwise 
give rise to misunderstandings later. Given a ring R over F, we may regard 
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any homomorphic image R* — R/T of R as again a ring over F by defining 
ocx* — (ax)* in the usual way. However, if we agree to regard two operators 
as equal relative to a given ring (which admits them both) if and only if they 
have the same effects on each element of the ring, then the operator set on R * is, 
strictly, not F but the factor ring F* — F/G, where G denotes the ideal of F 
consisting of all a 6 F such that aR < T. This distinction, vacuous when F 
is a field, can nevertheless be vital for more general operator rings F (particu- 
larly when their cardinals or characteristics are in question). Also, if we had 
chosen to define algebraic elements by means of (1), we could not have asserted 
that R being algebraic over F implies that R* is algebraic over F* (since some 
x 6 R might satisfy only equations (1) in which each coefficient a< € G ); 
however, using (2) ensures homomorphism-invariance for the algebraic 
property (since the identity element of F maps onto that of F*). 

In view of these remarks, it is not strictly true to say that every ring may 
be regarded as a ring over the ring I of integers: in fact this will be legitimate 
for a given ring R if and only if, for each positive integer k, an element x exists 
in R such that kx ^ 0. However, it is convenient and in practice not seriously 
confusing to be a little inexact in this connexion: we shall allow ourselves the 
customary liberty of regarding any ring R as a ring over I (rather than some 
quotient ring of I). Thus, for example, any algebra algebraic over a finite 
field of prime order will be regarded also as algebraic over the integers. 

Our definition of the algebraic property via (2), while fulfilling most 
reasonable requirements, does have the slight technical disadvantage of carry- 
ing with it no immediately available concept of a minimal polynomial; for, 
among the polynomials satisfying (2), there is in general more than one of 
minimal “lower degree” m (even if we demand that n — m be also minimal). 
However, at least when F is an integral domain, we can get something with 
most of the usual properties by returning to (1). 

Let R be a ring with arbitrary operators F, and x any element of R algebraic 
over F. Then x satisfies an equation of the form (2), and a fortiori satisfies 
equations of the form (1), i.e. there are non-zero polynomials /(X) over F, 
without constant terms, such that f(x) = 0. Among such polynomials /(X), 
all those of minimal degree (there will in general be several, possibly infinitely 
many) will be called minimal polynomials for x over F. We note two relevant 
lemmas; the first is standard and leads immediately to the second. 

Lemma 2.1. Let /(X), g{\) be arbitrary formal polynomials over a given 
commutative ring F, with leading terms aX", /3X* respectively. Then there are 
polynomials g(X), r(X) over F, with r(X) zero or of degree strictly less than n, 
such that 

a k g(\) = <?(A)/(X) + r(X). 

Lemma 2.2. Let x be any algebraic element of a given ring R over F, and let 
/(X) be any minimal polynomial for x over F, say with leading term aX*. Then, 
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given any polynomial g(\) over F, of degree k say, such that g(x) — 0, there is a 
Polynomial q{\) over F ( possibly with constant term ) such that 

a k g(\) = g(X)/(X). 

Of course Lemma 2.2 is of value only when we can be sure that a* 0. 
If F is an integral domain we even have some measure of uniqueness (“up to 
scalar factors”) for our minimal polynomials: for, if/, g are two such, with 
leading terms aX n , /5X*, then, by Lemma 2.2, polynomials p{\), q{\) exist such 
that a k g — qf, (Pf = pg, whence a k fi n f — pqf. Thus, for an integral domain F, 
since / is not identically zero, we have a k 0 n = p{\) q(X), and so p{\), q{\) 
must both be non-zero constants; in other words, any two minimal polynomials 
of a given element x algebraic over an integral domain must have a common 
non-zero scalar multiple. 


3. Some general properties of algebraic rings. Our first theorem (which 
will find a use later) is a direct adaptation of a result from elementary algebraic 
number theory: 

Theorem 3.1. Let F he a field, algebraic over a given subfield F 0 . Then every 
algebra algebraic over F is algebraic over F 0 . 

Proof. Let R be any ring over F, and x any element of R algebraic over F, 
say satisfying (2) above, with each a< £ F. Since F is a field, we can single 
out from the non-zero a< that one, say a Q , with greatest index q, multiply 
through by a q ~ l , and write 

x Q = Pix + . . . /Sj-ix 9 - 1 , 

where q > m > 1, each /3< 6 F. Hence, if we denote the field F 0 (/9x, . . . , @ q -i) 
by K , then the algebra i£[x] is finite-dimensional over K, while also K 
is itself a finite extension of F 0 (since each is algebraic over F 0 ). Thus K[x] 
can be regarded as a finite-dimensional algebra over F 0 (its dimension as such 
being given by dim(iC[x]: K) dim(iC: F 0 )), that is, x is algebraic over F 0 , as 
required. 

We come now to some of our principal results. 

Theorem 3.2. Let F be any ( commutative ) integral domain, not a field but 
having a unit element, and let R be any ring algebraic over F. Then, given any 
element x of R, any equation of the form 

(3) a m x m + a m+1 x m+1 -f • . • + oc n x n = 0 


with each 6 F and a m 9* 0 {and of course some such relation holds) implies 
the existence of a non-zero element a of F such that ax m = 0. 


Proof. We can write a m x m = x m a, where a is in the subring of R generated 
by x; then a m j x m = x m a j (j — 1 , 2, . . .), and so, taking j = m + 1 , we can 
find b £ R such that a m m+l x m = x m bx n 


a” + V* = 


Defining e = x m b, we then have 

m - f 1 


a* 


e = e 
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Now, for any 6 F, since R is algebraic, we can find a positive integer 
tp and a polynomial kp(\) over F such that (/3e) = ((3e) 'v+'kpifie). Also, by 

use of the relation e 2 = a m m+1 e, we can express e 2 kp(fie) — 9ge for some Op € F, 
so that /S'/Je'/s = 9pe l e, that is 

0 = - pd ff ) e* = - P6 g ) 

consequently, for each /3 £ F, 

j8*(l - (39p) ofi*” V = 0. 

Thus either a = jS'/sQ — /30 / j)a m (m+1) ‘u is non-zero for some /3 € F, as re- 
quired, or else — /30/s)a ra (m+1) = 0 for all /3 6 F; and in this latter case 
(since a m ^ 0 and F is an integral domain) we should have 1 = fiQg for each 
non-zero /3 ( F, contrary to our hypothesis that F is not a field. 

Corollary 3.1. Let R be any ring of characteristic zero. Then R is algebraic 
over the ring of integers if and only if R is nil. 

Proof. The ring of integers satisfies the conditions on F in Theorem 3.2, 
so, if R is algebraic over the integers, then, to each x € R, there correspond a 
non-zero integer a = a(x) and a positive integer m — m(x) such that ax m = 0; 
and, since R has characteristic zero, this implies that x m = 0, whence R is nil. 
The converse is obvious. 

Theorem 3.2 may be regarded as generalizing the known fact (7, Theorem 
11) that, if every element of a ring R satisfies x" (2) = x for some integer 
n(x) > 2, then every element of R has finite additive order; indeed, for any 
element x of a ring R satisfying this more stringent condition, and any ad- 
missible operator ring F, our argument shows that either an element a of F 
exists such that ax = 0, aR ^ 0, or F has the same property as R (so that, 
if F is an integral domain, it must be an algebraic field of prime characteristic). 

The argument of Theorem 3.2 can easily be modified to show that, with F, 
R as before, every regular element of i? h?s a non-zero annihilator in F. We 
can also, without appreciably more trouble, prove the following generalization 
of Theorem 3.2 (cf. 11 ): 

Theorem 3.3. Let F be any integral domain, R any ring over F, and x any 
element of R. Suppose also that there exists a non-zero element tt = ir(x) of F 
such that, to each element y of the subring of R generated by x, corresponds a 
non-zero polynomial g y (\) over F, whose lowest non-zero coefficient is not divisible 
by ir, such that g v (y) = 0. Then any equation of the form (3) with each a t € F 
and a m 0 implies that a non-zero element a of F exists satisfying ax m = 0. 

Proof. As in the proof of Theorem 3.2, we can find an element e of the 
subring generated by x such that a m m+x x m = ex m , a m m+l e = e 2 . By our hypo- 
thesis, for each fi 6 F, there is a polynomial over F of the form g0«(X) = 
ypX^ — \'ti+ l kg(\), with yg not divisible by ir, such that gp e (Pe) = 0. As 
before, we deduce that, for each j8 ( F, an element 9p of F exists such that 

e' s (yt - ms) <4r +1, V‘ = 0 . 
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Finally, taking /3 = tt,. since r 0, a m 0 and since y r is not divisible by ir, 
we can be sure that a = Tr‘*(y r — 7r0 T )a„ l (m+1)t * 0. 

It is hardly necessary to mention that the existence of an element ir of F 
satisfying the conditions of Theorem 3.3 ensures that F cannot be a field. 
As a corollary of Theorem 3.2 itself (or more generally of Theorem 3.3) it is 
obvious that any minimal polynomial of x must have the monomial form 
a\ n . This is not difficult to see even under a hypothesis substantially weaker 
than that all elements of the subring generated by x be algebraic, as we 
show next : 

Theorem 3.4. Let F be any integral domain , not a field but having a unit 
element , let R be any ring over F, and x a given element of R. Then , if every 
F-multiple yx of x is algebraic over F, and if 

H X) = oc m \ m + <WiX m+1 + . . . + « n X B , 

with a m 5^0 , a n 0, is a given minimal polynomial for x over F, we must have 
m = n (so that a m x m — 0). 

Proof. Suppose by way of contradiction that m n, that is, n — m > 1, 
and let /3 be an arbitrary non-zero element of F (fixed throughout the ensuing 
argument). Then 

0 = pd-'Hx) = 

+ • • • + Pot n -i(a n (3x) n 1 + (a n 0x) n , 

and so, defining y — aJ3x and 

r/\ \ o n ~ m n— «— 1 -v m . a n—m—l n—m — 2 m+1 . , Q ,n-l , x n 

/(X) = P a n a m \ + p a n a m+ iX + . . . + jSa B _jX +X, 

we have/(y) = 0. Indeed, /(X) is a minimal polynomial for y (since, F being 
an integral domain and a„/3 being non-zero, if y satisfied an equation of lower 
degree, so would x). 

Now, y being an F-multiple of x, our hypothesis assures us of the existence 
of a positive integer t and a polynomial k(\) over F such that y l = y t+l k(y). 
Thus, by Lemma 2.2 (with a = 1), there is a polynomial q(X) over F such 
that 

X* - X‘ +1 fc(X) =/(X)<z(X) 

identically. Since a m ^ 0, comparison of coefficients of X 1 on either side gives 

1 = /3 n ~ m a n H ~ m ~ 1 a m ^, where £ = & is the lowest non -zero coefficient of g(X), 
that is 1 = (300 , where 

n nft — Tfl — 1 fl — Itl — 1 y 

00 — P Ol n 0- 

But, since /3 was an arbitrary non-zero element of F, this would contradict 
our hypothesis that F is not a field; thus in fact m = n, as required. 

There is naturally an extension of Theorem 3.4 along the lines of Theorem 
3.3, but we shall not state it formally. However, we note the (trivial and 
known) corollary that, if z is a complex number such that z/0 is an algebraic 
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integer (in the usual number-theoretic sense) for every positive integer 0, 
then z = 0; to see this, one has only to suppose the contrary, and take x = \/z 
in Theorem 3.4. 

4. Diagonable rings. On being given any positive integer q and on writing 
1„ for the unit q X q matrix, it is customary to call a q X q matrix x, with 
elements in a given field F, diagonable over F if distinct elements fr, . . . , 0, 
of F exist such that 

( x - /3il c ) ... (x - /3,lg) = 0 

(where 5 can be any positive integer). There are several well-known alternative 
forms for this definition (e.g. in terms of the existence of a non-singular 
q X q matrix b over F such that b~ l xb is diagonal). We shall adopt the following 
(obviously equivalent) form: x is diagonable over F if and only if there are 
distinct elements 71, ...» 7* of F such that 

(4) x(y x x + 1«) . . . (7 1* + l c ) = 0. 

It will be noted that we have not required F to be algebraically closed; indeed, 
our definition remains significant for any ( commutative ) ring F. Further, 
since the unit matrix l ff now occurs only in a purely formal way (i.e. can be 
got rid of by multiplying out the factors in (4)), we may apply the definition 
to any ring R admitting the operators F (i.e. not merely to rings of square 
matrices over F). If every element of a ring R over F is diagonable over F, 
we shall say that R is itself diagonable over F. Obviously every diagonable 
ring over F is algebraic over F. 

Motzkin and Taussky (10) showed that, if x, y are given q X q matrices 
over an algebraically closed field F, and if also ax + /3 y is diagonable over F 
for all choices of a, /3 in F, then xy = yx (whence it is easy to deduce the 
existence of a non-singular qXq matrix b over F reducing x and y simultaneously 
to diagonal forms b~'xb, b~ l yb ). Their proof (a geometrical one) is long; and, 
since hypotheses are made only about the F-module generated by x and y, 
ring-theoretic methods are perhaps not very suitable for dealing with the 
problem. However, if we are prepared to extend the diagonability hypothesis 
to all “non-commutative polynomials” in x and y, then the proof that x, y 
commute becomes almost trivial; indeed, for rings with arbitrary operators, 
we shall show in our next theorem that diagonability always implies commuta- 
tivity. The proof depends on a familiar property of strongly regular rings; 
for completeness, we first derive this property, and indeed something more 
general, in the following lemma (which will in any case be needed later on 
in §6): 

Lemma 4.1. Let R be any ring in which, to each pair of elements x, y, there 
corresponds a non-negative integer r such that xy T is in the right ideal of R gen- 
erated ( over the given operator ring F) by y and x 2 . Then, if J denotes the Jacobson 
radical of R, R/J is a subdirect sum of division rings. 
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Proof. We know from Jacobson’s structure theory that R/J is a subdirect 
sum of primitive rings, each of which is a homomorphic image of R/J and hence 
of R; and each of these primitive rings inherits the (clearly homomorphism- 
transitive) hypothesis on R. Thus it will be enough to show that if R is itself 
primitive then R must be a division ring. 

To call R primitive is the same as to say that R is isomorphic with a dense 
ring M of linear transformations of a vector space V over a division ring D. 
We shall denote the result of operating on v € V with x £ M by vx (i.e. 
regard M and D as operating on V from the right), and have only to show that 
V cannot contain two elements Vi, v 2 independent with respect to D. But, in 
the contrary case, since M is dense, we could choose x, y in M so that 

v\X = v 2 , v 2 x = 0, V\y = 0, v 2 y = v 2 \ 
then, for any a, & £ F, any a, b £ R, and any non-negative integer r, 

V\{xy r — ax 2 — x 2 a — (3y — yb ) = v\ x(y T — ax — xa ) — 0 = v 2 y r = v 2 ^ 0 

(by the ^-independence of v\, v 2 ). But our hypothesis on R asserts that, x, y 
being chosen, we can find a, /?, a , b, r such that xy T — ax 2 — x 2 a — (3y — yb = 0 ; 
thus we have our desired contradiction. 

Theorem 4.1. Every diagonable ring is commutative. 

Proof. Given any element x of a diagonable ring R, then, on taking yi, ... ,y t 
as in equation (4) above and on writing 

( 71 X -f- 1) . . . (7 A + 1) = 1 — Xg(X), 

g(\) is a polynomial over F (possibly with constant term), and x = x 2 h(x), 
where h(\) = Xg 2 (X) is a polynomial over F without constant term (so that 
h(x) is well-defined). Thus, given any x £ R, we can find an element a — h(x) 
of R such that x = x 2 a. In other words, every diagonable ring R is strongly 
regular and hence semi-simple in Jacobson’s sense, and so, by Lemma 4.1 
(with r = 0), R is a subdirect sum of division rings, each of which is a homo- 
morphic image of R and consequently diagonable. But a diagonable division 
ring is obviously commutative, so we deduce that R must in fact be a subdirect 
sum of fields. 

5. Additive functions on x-regular rings. We recall (cf. 8) that an 
element x of a ring R is said to be x -regular in R if a positive integer s = $(x) 
and an element b = b(x) of R exist satisfying x* = x*bx*. Given any elements 
x, y of a ring, we shall use [jc, y] to denote their additive commutator xy — yx. 

Theorem 5.1. Let R be any ring , let © be any given set with a transitive 
binary relation < defined on it, and let ^ be any set of mappings of R into ©. 
Then, if we denote by M(x) the result of operating on a typical element x of R 
by a typical element M of the statement (i) to each choice of x in R and M in 
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^ there correspond c £ R, N € ^ and an integer t > 2 such that [x‘, c] = 0 
and M(x) < N(x l c), implies (ii) M(z) < M(0) for every M € ^ and every 
nilpotent element z of R. 

Conversely, if (ii) holds, then (iii) for any given T-regular element x of R, say 
with x * = x‘bx s , we have M(x — x i+1 5) < M(0) for all M 6 


Proof. Suppose first that (i) holds. Then, given any x = Xo in R and any 
M — M 0 in we can find a sequence of integers tj > 2, a sequence c } of 
elements of R and a sequence of mappings M, 6 ^ (j — 1, 2, . . .) such that 

xj = XjLiCj, [xjii, cj] = 0, Mj-i (*/_i) < Mj(xj) (J = 1, 2, . . .). 

Since < is transitive on M{x) = Mo{xo) < Mj(xj) (j — 1, 2, . . .), while 
a simple induction argument shows that 


Xj = x " - V 


b r tt- 
C 2 


Cj-lCj 


0 '= 1 , 2 ,...); 


combining these two remarks we obtain (ii) on taking j sufficiently large. 

To prove that (ii) implies (iii) we notice that (x — x* +1 b)* can be written 
in the form x * + x s d (for a suitably chosen d € R), so that 


(x - x'+'b)* 1 = (x - x t+1 b ) x s + (x- x’ +1 b) x * . d; 


also, if x $ = x s bx *, then 


(* - x* l b) x* = a:* 4 - 1 - x' +1 bx* = x* 1 - x.x' = 0, 


.*+i 


*+ij 


.»+i 


so that x — x s+1 b is nilpotent, and (ii) gives M(x — x ,+l b) < M{ 0). 

It is perhaps worth remarking that, if x* = x*bx s and we write 5 = 2r — 1+5, 
where r is a positive integer and 5 = 0 or 1, then one can show (only slightly 
less easily than in the second part of the proof above) that (* — jc r+ ®6x r )* = 0, 
so that (ii) also implies M{x — x T+i bx T ) < however, this fact seems to 

be less useful in applications. 

We have set out Theorem 5.1 in the very general (and accordingly rather 
bogus-looking) form above in order to highlight the essential argument, 
which will be successively more and more obscured in our next theorems 
(where we return to earth, and make the “converse” more worthy of the name, 
by specializing <25, We shall mean by an additive function on R any 

mapping, say /: x — > f(x), of R into itself such that f(x + y) = f(jx) +/(y) 
for all x, y 6 R’, in particular, /(0) = 0. We do not require that /(ax) = af(x) 
for admissible operators a. 


Theorem 5.2. Let R be any ring, and ^ any set of additive functions on R. 
Then the statement (i) to each choice of x in R and f in & there correspond 
c € R, g £ <&and an integer t > 2 such that [x‘, c] = 0 and such that /(x) is in 
the two-sided ideal of R generated by g(x‘c), implies (ii)/(z) = 0 for every f 6 & 
and every nilpotent element z of R. 

Conversely, if (ii) holds, then (iii) for any given ir-regular element x of R, say 
with x* = x’bx 3 , we have /(x) = f(x t+1 b) for every f 6 
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Proof. For any x 6 R, f € let M /(x) denote the two-sided ideal of R 
generated by the element /(x) of R. Then Theorem 5.2 is just the special case 
of Theorem 5.1 with © chosen as the set of all two-sided ideals of R, ordered 
in the natural way by inclusion, and with •Jf chosen as the set of all mappings 
M f \ x — ► M f (x). 

We recall that an element x of a ring R is said to be strongly regular in R 
if an element a = a(x) of R exists such that x = x 2 a. 

Theorem 5.3. Every x -regular ring without non-zero nilpotent elements is 
strongly regular. 

Proof. This follows at once from the second part of Theorem 5.2 on taking 

to consist of the single function f:x—*f(x) = x. Alternatively and more 
directly, going back to the proof of Theorem 5.1, we have merely to observe 
that x * = x'bx* implies ( x — jc* + 1 6) ,+1 = 0, so that, if R has no non-zero 
nilpotent elements, then x — x ,+l b = 0, that is 

x = x t (x t ~ l b). 

Conversely, if R is strongly regular, then (independently of the 7r-regularity 
hypothesis) of course R can obtain no non-zero nilpotent element. Thus we 
see that, among x-regular rings, the property of having no non-zero nilpotent 
elements is homomorphism-invariant. 

From now on all we shall need of what has already been proved in this 
section is the following consequence of Theorem 5.2: 

THEOREM 5.4. For any ring R, the statement (i) to each choice of x, y in R 
there correspond c 6 R and an integer t > 2 such that [x‘, c] = [x — x*c, y] — 0, 
implies (ii) every nilpotent element of R is central. 

Conversely, if (ii) holds, and x is any given x-regular element of R, say with 
x * = x'bx 1 , then (i) holds, for this x and all y, with c = b and t = 2 if s = 1, 
and with c — xb and t = s otherwise. 

Proof. The first part is essentially the special case of the corresponding 
part of Theorem 5.2 with & chosen as the set of “commutator functions” 
/,:*—► [x, y] (one such function being associated with each y £ R)’, indeed, 
we have thrown away some generality elsewhere by writing [x — x‘c, y] = 0 
in (i) rather than the weaker “for some z in R, [x, y] lies in the two-sided ideal 
of R generated by [x f c, z]. M 

To prove the converse, we quote from (2) that (ii) implies that every 
idempotent element e of R is central. Taking e=x'b, we deduce that x*=x 2a b-, 
in a similar way, we see that x* = 6x 2 *. Hence 

x*b = bx 2> . b = b . x u b — bx'\ 

also, by the converse part of Theorem 5.2, x — x ,+1 b is central, so the proof 
is complete. 
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In particular, we have proved that (i) and (ii) are equivalent in any 
x-regular ring. For the special case of rings algebraic over the integers, the 
two parts of Theorem 5.4 are implicit in Herstein’s papers (5, Lemma 3; 
6) respectively. We note also the following immediate consequence of Theorem 
5.4: 

Corollary 5.1. Among ir-regular rings , the property of having all nilpotent 
elements central is preserved under homomorphism ( even under homomorphisms 
which do not commute with the given operators). 

This corollary cannot of course be extended to arbitrary rings (consider for 
example the free ring R generated over the integers by two non-commutative 
indeterminates, and the natural homomorphism of R onto i?/(4R)). 

Combining Herstein’s theorem (quoted in the Introduction) with the 
converse part of Theorem 5.4, we have (since every algebraic ring is clearly 
7r-regular) 

Theorem 5.5. Let Rbe a given ring algebraic over the integers ( or any quotient 
ring), and suppose that every nilpotent element of R is central. Then R is commuta- 
tive. 

This was previously pointed out by Herstein (5), and generalizes a result of 
Arens and Kaplansky (1, Theorem 4.2). They proved commutativity for 
any ring R, necessarily without non-zero nilpotent elements, in which each 
element x has finite non-zero additive order and satisfies an equation of the 
form 

(1) «!* + ...+ a n x n = 0, 

with integral and a n x n 5^ 0. For in these circumstances every 

element has squarefree characteristic, so that R is the restricted direct sum 
of R( P ), where p takes all prime values and R (p ) denotes the set of all * € R 
with px = 0; and it is easy to see that each i? (p) is algebraic over the integers 
(in our sense) and without non-zero nilpotent elements. Restrictions on the 
additive orders of elements of R are no longer in evidence in the statement of 
Theorem 5.5; however, Theorem 3.2 shows that this aspect of generalization 
of the result of Arens and Kaplansky is illusory. 

We should naturally like to have something similar to Theorem 5.5 valid 
for rings with more general operators than the integers. Such a generalization 
would of course follow for a given operator ring F if Herstein’s theorem could 
be extended to allow elements of F as coefficients in p z (\). Consideration of 
the quaternion algebra over the reals sets a limit on such hopes, but, by 
Theorems 3.1 and 5.5, we do have at least the following generalization of 
Jacobson’s result (7, Theorem 9) mentioned in the Introduction: 

Theorem 5.6. Let F be any field of non-zero characteristic algebraic over its 
prime subfield (in particular, any finite field). Then, if a given algebra R algebraic 
over F has all its nilpotent elements central, R is commutative. 
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It will be noted that the hypotheses on F imply that F is a perfect field; 
however, the quaternions show that the result does not hold for all perfect 
fields F. 

We note also the following analogous, and more elementary, result: 

Theorem 5.7. Let F be any algebraically closed field. Then, if a given algebra 
R algebraic over F has all its nilpotent elements central, R is commutative. 

Proof. Given any element x of R, then, since R is algebraic, x generates a 
finite-dimensional subalgebra over F, and, since F is algebraically closed, 
consequently, by the theory of the classical canonical form, we can write 

* = /+ £«<«<, 

t 

where / is a nilpotent (and hence central) element of R, the e t are idempotent 
elements of R, and the a< are in F. Thus, to prove R commutative, it would be 
enough to show that all idempotent elements of R commute with one another. 
But in fact, by (2) again, the hypothesis that all nilpotent elements are 
central implies (in any ring) that every idempotent element is central, so 
the result follows. 

6. H-rings. We now turn to some questions arising from Herstein’s theorem. 
Herstein’s method of proof was to settle first the division ring case (which he 
succeeded in doing by a comparatively short argument), and then to show 
(by a rather lengthy sequence of lemmas) how the result for arbitrary rings 
can be reduced to this special case. Herstein has conjectured (in a letter to 
the writer) that if, to each element x of a given ring R, there corresponds an 
element a of R such that x — x 2 a is central, then R is a subdirect sum of a 
commutative ring and a (possibly vacuous) set of division rings; we shall refer 
to this as Herstein' s conjecture. 

This conjecture can reasonably be thought of as generalizing Herstein’s 
theorem, since any division ring D occurring as a subdirect summand of R is 
necessarily a homomorphic image of R, so that, if a is always a polynomial in 
the x € R to which it corresponds, then a similar statement holds for D 
(while, as we have noted, the division ring case of Herstein’s theorem takes 
up only a small part of the proof). Further, the conjecture, if true, would 
have over the theorem the advantage that its (much weaker) hypothesis 
does not involve any restriction on the operators, whereas the quaternion 
ring shows that the theorem as originally stated definitely does not extend 
to rings with arbitrary operators (rather than the integers). Thus the con- 
jecture embodies as much as one could hope to be true in the general case 
and also, essentially, in the case of integer operators first considered by 
Herstein; if the conjecture could be substantiated, the theorem (and most of 
its subsequent ramifications) could be deduced from it in a comparatively 
trivial way. 
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We shall in fact consider here only the case in which x 2 a = x l c = cx‘, 
where c € R and / is some integer with / > 2, but we can afford to weaken 
the centrality condition slightly. Formally, we call a given ring R an H-ring 
if, to each pair x, y in R, there correspond c = c{x, y) £ R and an integer 
t — t{x,y) >2 such that 

[x\ c] = [x — x*c, y] = 0. 

Certain of Herstein’s arguments can be straightforwardly generalized to apply 
to these rings; since every division ring is an jFf-ring (e.g. with c = x~ l for 
x 0 and otherwise arbitrary) we cannot hope to prove all JT-rings commuta- 
tive, and we shall be chiefly concerned with side-conditions sufficient to ensure 
commutativity (cf. Theorem 4.1 above). Our next result shows that all H- rings 
have a certain property which would follow as an immediate consequence of 
the truth of Herstein’s conjecture; and, conversely, that, when we restrict 
attention to 7r-regular rings, this property actually characterizes the If-rings: 

Theorem 6.1. Every H-ring has all its nilpotent elements central. Conversely, 
if a given ring R is -k - regular and all its nilpotent elements are central, then R 
is an H-ring and c, t can be chosen independently of y; also, if R is algebraic 
{over some given ring F of operators), then, corresponding to each x € R, there is 
a polynomial p x {\) over F such that x — x 2 p x (x) is central. 

This is, essentially, just a partial restatement of a special case of Theorem 5.4 
in H-ring terminology. 

Extending slightly concepts which have been used by Goldhaber and 
Whaples (4) and by McLaughlin and Rosenberg (9), we shall say that a 
commutative ring F is quasi-algebraically closed if every division ring algebraic 
over F (or over a factor ring of F ) is commutative. Obviously every algebrai- 
cally closed field is quasi-algebraically closed; and, by Theorems 5.5 and 5.6, 
the property of being quasi-algebraically closed is also shared by the ring of 
integers (with all its quotient rings), and by every finite field. 

If, in Lemma 4.1, F is quasi-algebraically closed and R is algebraic over F, 
then clearly R/J is a subdirect sum of fields, and so (since J is nil in any 
ir-regular ring) R is commutator-nil, i.e. the two-sided ideal of R generated 
by all the commutators [x, y] with x, y £ R is a nil ideal; 1 and clearly every 
H- ring satisfies the hypothesis of Lemma 4.1 (with r = 1). If Herstein’s 
conjecture were true, we should even have commutativity for all algebraic 
H-rings over quasi-algebraically closed operator rings F. Not every algebraic 
H- ring is commutative (consider again the quaternions), but, by combining 
Theorem 6.1 with Herstein’s theorem, and also with Theorems 3.1 and 5.7, 
we find 

‘Clearly this conclusion still holds good even if R is given as only x-regular (rather than 
actually algebraic) provided that every division ring over Fis commutative; a variety of anal- 
ogous results can be obtained by weakening the hypothesis on either R or Fand correspondingly 
strengthening the hypothesis on the other. 
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Theorem 6.2. Every H-ring algebraic over the integers, or over any finite or 
algebraically closed field, is commutative. 

More generally, if Herstein’s theorem could be extended to allow given 
operators F as coefficients in p x (X), then we could similarly show that every 
H-ring algebraic over this particular F is commutative. Commutativity 
(and hence local finiteness) would then of course follow for every division ring 
algebraic over F; and this is the same as to say that F is quasi-algebraically 
closed. Thus Herstein’s theorem definitely cannot be extended to any non- 
quasi-algebraically closed F. 

Without prejudging how far Herstein’s theorem does extend, or whether 
his conjecture is in fact true, we can at least show that every H-ring algebraic 
over a quasi-algebraically closed field F must be locally finite. For the algebraic 
condition on R makes J nil and consequently (by Theorem 6.1) central, 
while we have previously seen (from Lemma 4.1) that R/J must be commuta- 
tive. Then R/J and J, being commutative algebraic algebras over F, are both 
locally finite over F, whence, by (7, Theorem 15), R is itself locally finite, as 
we asserted. 

Now commutativity for R is equivalent to that of all its doubly generated 
subrings; also these are finite-dimensional over F by what we have just proved, 
and are H-rings in view of the last part of Theorem 6.1. Thus, to prove com- 
mutativity for all H-rings algebraic over a given quasi-algebraically closed 
field F, it is enough to do so only for finite-dimensional R; this is easy when 
F is also perfect (and, more generally, whenever R can be expressed as a 
supplementary sum R = S J with S = R/J)- 
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S. A. AMITSUR 

Introduction. Let R be a ring and let i?[x] be the ring of all polynomials 
in a commutative indeterminate x over R. Let J(R) denote the Jacobson 
radical (5) of the ring R and let L(R) be the lower radical (4) of R. The main 
object of the present note is to determine the radicals /(i?[x]) and L(R[x]). 
The Jacobson radical /(i?[x]) is shown to be a polynomial ring iV[jc] over a 
nil ideal N of R and the lower radical L(i?[x]) is the polynomial ring L (_/?)[*]. 
A partial result of the first case and a parallel solution to the second case have 
been obtained also independently and by different methods by N. H. McCoy 
(simultaneously with the author). 

The present method of attacking these problems can be applied to many 
other radicals arising from ^-properties (1) of rings. Let tt(R) denote the 
x-radical of a ring R. J(R) is an example of a radical satisfying n-(R[x]) = P[x] 
where P — r (R[x])C\R, and L(R) represents a class of radicals satisfying 
x(R[x]) = 7T (J?) [ac]. The results obtained can be easily extended to poly- 
nomials in any number of variables. 

It is shown that /(i?[x]) = iV[x] where N is a nil ideal in R. Snapper, who 
studied the Jacobson radical of polynomial rings over commutative rings R, 
has shown (7) that N is the maximal nil ideal in R. The extension of this result 
to arbitrary rings seems to be very difficult. Though we verify this fact for 
algebras over non-den umerable fields, the general problem of determining the 
ideal N remains open. 

1. The Jacobson radical 

Lemma 1 J : Let N = 7(i?[x]) C\ R, then J(R[x]) 5 ^ 0 implies N 7 * 0. 

The proof of this Lemma, which is a keystone in the extension of the results 
on the Jacobson radical to arbitrary radicals, seems to be rather elementary 
if R is an algebra over an infinite field, or if R is of characteristic zero; but the 
proof is far more complicated in the general case. 

Recall that the Jacobson radical is a radical of the type dealt in (1). In 
particular, it follows by Corollary 1.1 of (1) that J(R[x]) remains invariant 
under the automorphisms of i?[x]. For example, consider the automorphism 1 : 
/(x) — >/(x 4- 1) of R[x], or more generally the automorphism :/(x) — >/(x4-X), 
where X is an endomorphism of the additive group of R satisfying \{ah) = 
(Xa)6 = a(\b) for a, b 6 R- 

Put J = 7(i?[x]). If the Lemma is not true, then we have a case where 
J 7 * 0 but J C\ R — N = 0. Let f(x) be a non zero polynomial of minimum 

Received July 25, 1955. 

x One does not need to assume that R contains a unit. 
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degree belonging to J. By the previous remarks it follows also that/(x + 1) € J. 
Hence / 0 (x) = fix -f 1) — fix) 6 J since the degree of / 0 (x ) is less than that 
of fix). The minimality of the latter implies that / 0 (x) =0. Thus/(x-f-l) =f(x). 

If R is of characteristic zero, then one readily verifies that f(x + 1) = fix) 
can hold only if fix) = a € R . Thus 0 a £ J R which is a contradiction. 
Another immediate contradiction is readily obtained if R is assumed to be an 
algebra over an infinite field F. Indeed, the preceding arguments can be 
applied as well to the automorphisms: g(x) —■ ► g(x + X), X £ F, of i?[x]. 
This yields that fix + X) = f{x) for all X £ F. Since F is infinite, the last 
relation implies that/(x) = a 6 R, and thus 0 ^ a £ N which contradicts 
the assumption N = 0. 

In order to obtain a contradiction in the general case we have to make some 
additional remarks: Let p be a prime number and let R p be the set of all 
elements of R which are of characteristic p. Note that R is an ideal in R and, 
therefore -R P [x] is an ideal in i?[x]. We may assume that/(x) 6 i?„[x]. Indeed, 
let /( x) = aoX n + . . . + a n . Since N — 0 it follows that n > 1. Hence, since 
f{x + 1) — fix) = naoX^~ l + • • . = 0 we obtain that nao = 0. Let m be 
the minimal integer satisfying ma Q — 0 and let p be a prime dividing m. 
Thus im/p)ao7*0 and clearly im/p)a 0 € Ro ■ We may replace fix) by the poly- 
nomial im/p)fi x) which belongs also to J and which is of the same degree as 
fix). Namely, we suppose that the highest coefficient of fix) belongs to R p . 
Now pfix) 6 J and its degree is smaller than the degree of fix), hence the 
minimality of the latter yields pfix) = 0, that is, fix) £ 

Next we show that if a polynomial g(x) 6 R P [x] satisfies gix + 1) = g(*) 
then g(^r) = hix v — x) is a polynomial in x v — x with coefficients in R p . The 
proof is carried out by induction on the degree of gix). First, let gix) be a 
polynomial of degree k < p. Since gix + 1) = £(*) it follows that gix + v) = 
gix) for all integers v. Now, 

gix + v) = giv) + xg^) -f . . . + x n g m iv) = b 0 + xbi + . . . + x m b m = gix). 

Hence, g(v) = b 0 for all integers v. Clearly, R P is an algebra over the finite 
field GF[p] of p elements. Thus we obtained that gix) — b 0 vanishes for p 
elements of the field GF[p\. Since the degree of gix) — bo is less than p, it 
follows that g(x) — b 0 = 0, that is, gix) = b 0 € R P . 

Let gix) be a polynomial of arbitrary degree, 2 then g(x) = hix)ix p — x) + 
&(x), with the degree of &(x) < p. Hence, g(x + 1) = hix + l)(x p — x) + 
kix + 1) and since g(x + 1) = g(x) we obtain 

[ft(x + 1) — hix)]ix v — x) = &(x) — kix -f 1). 

The degree of the right-hand side of the last equality is less than p and the 
degree of the left-hand side, if not zero, is >/>. It follows, therefore, that 

2 This holds in the ring R* v [x], where R* is obtained by adjoining a unit to R, but, clearly, 
g(x) and k(x) belong to 
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k(x) = k(x + 1) and h(x 4- 1) = Ms). By the previous case we know that 
k(x) = ko € R p and by induction it follows that h(x) — ho(x p — x). Thus 
g(x) = ho(x p — x)(x p — x) 4* ko is a polynomial in 2? p [x p — x]. 

The last preparatory remark we need before completing the proof of the 
Lemma is to the effect that if a polynomial h(x p — x) belongs to the Jacobson 
radical of R p [x), then it belongs also to the Jacobson radical of R v [x p — x]. 
Indeed, let k(x) £ h(x v — x) 7? p [x p — x], then k(x -f 1) = k(x). Clearly 
k(x) belongs to the Jacobson radical of i? p [x] hence its quasi-inverse k' (x) is 
uniquely determined. The quasi-inverse of k(x + 1) is readily seen to be 
k'(x -j- 1); hence k(x) = k(x + 1) implies that k'(x + 1) = k'(x). Con- 
sequently k'(x) £ R P [x p — x]. This proves that the right ideal h(x p — x) 
R p [x p — x] is quasi-regular in R p [x p — x]. Thus h(x p — x) belongs to the 
Jacobson radical of R p [x p — x]. 

We turn now to the proof of the Lemma. Since f(x) 6 J (~\ R p [x] and 
i? P [x] is an ideal in i?[x], it follows that/(x) € J(i? p [x]) = jR p [x] C\J. It was 
shown that /(x -f- 1) = /(x), hence /(x) £ R p [x p — x\. Thus by the previous 
remarks, it follows that/(x) = g(x p — x) and g(x p — x) belongs to the Jacob- 
son radical /(i? p [x p — x]). The mapping h (x) — > h (x p — x) determines an 
automorphism between R p [x ] and R p [x p — x]. It follows now, by Theorem 
1.7 of (1), that J(/? p [x]) is the image of J(R p [x p — x]). In particular, it follows 
that since g(x p — x) = /(x) € J(R p [x p — x]), g(x) 6 /(i? p [x]). But g(x) is of 
lower degree 3 than /(x); hence, g(x) 6 /(i? p [x]) = J R p [x] implies that 
g(x) 6 J , which contradicts the minimality of/(x). This completes the proof 
of the Lemma. 

Lemma 2 J : J(i?[x]) = iV[x], where N = /(i?[x]) C\ R. 

Indeed, since N Q J = J(R[x]), it follows that N[x] 2?[x] C iV7?[x] C J. 
Hence iV[x] Q J. Consider the homomorphism: jR[x] — > i?[x]/iV[x]. The 
kernel of this homomorphism is 7V[x] C J. It follows, therefore, by Theorem 
1.7 of (1) that /(£[*]/#[*]) = J/N[x]. Let R - R/N, then R[x]/N[x]&R[x). 
Now: 

/(«[*]) n r ^ j/n[x] n (r, iv[x])/iv[x] = (jn (r, w[x]))/jv[x] 

= (J O R, N[x])/N[x] = (N, W[x])/iV[x] = 0 

since J 3 iV[x]. Hence, Lemma 1 implies that 0 = /(ft[x]) = J/N[x\. 
Consequently, J = iV[x], as required. 

It remains now to determine the structure of the ideal N. 

Lemma 3J : N is a nil ideal in R. 

Clearly, N is an ideal in R. Let r £ N C J, then r.rx = r 2 x € /. Let q(x) 
be the quasi-inverse of r 2 x, that is, q{x) + r 2 x + q{x)r 2 x = 0. In other words 

q(x) = — r 2 x — q(pc)r 2 x. 

*This is true since the degree of /(*) is 1. 
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Substitute q(x ) on the right by the whole expression of the right-hand side 
of this equality. Repeating this process yields 

q(x) = — r*x + (r 2 *) 3 + • • • 4* ( — l) R (r 2 ic)” -f- (— l) n+1 (r 2 *) n+1 

+ (-l)* +, g(*) (r 2 x) n+l . 

Choose n to be greater than the degree of q(x). Equating the coefficient of jc* 
on both sides yields that r 2n = 0. This proves that N is a nil ideal. 

Levitzki’s locally nilpotent radical s<r(R) of a ring R is defined (2, p. 130) as 
the maximal ideal of R with the property that its finitely generated subrings 
are nilpotent. One readily observes that the polynomial ring so-(R)[:e] is a nil 
ideal and, therefore, it also is quasi-regular. Consequently 5<r(R)[:c] C J, 
and thus so(R) C N. Summarizing the results obtained, we have 

Theorem 1. L(i?[x]) = A^[ac] where N = /(R[xj) R is a nil ideal con- 
taining the locally nilpotent radical so{R) of R. 

Remark. If R is commutative, or more generally satisfies a polynomial 
identity, then it is known (6) that the nil ideals of R are locally nilpoint ideals. 
Thus in this case N C sa(R), and therefore, 7(i?[»]) is a nil ideal and N is 
the maximal nil ideal of R. 

We restrict ourselves now to the case where R is an algebra over an infinite 
field F. An ideal I in an algebra R is called an LB /-ideal (3) if I is a nil ideal 
and every finitely generated submodule of I is of bounded index. One readily 
observes that if f(x) 6 ![*], where I is an LSI-ideal, then f(x) is nilpotent 
and its index is bounded by the index of the submodule of I generated by the 
coefficient of fix). Thus, I[x] C J. The maximal LSI-ideal, LBI{R), of R 
is known (3) as the LSI-radical of R. Hence, the preceding arguments yield, 
in view of the fact that LBI(R) 3 s<r(R), that: 

Corollary. N 3 LBI(R) 3 sa(R): 

It was shown in (3) that if R is an algebra over a non denumerable field F , 
then every nil ideal in R is an LSI-ideal. Consequently, for such algebras 
LBI(R) 3 N, which in particular implies that Af[x] is a nil ideal. Since the 
nil ideals are quasi-regular, it follows that: 

Theorem 2. If R is an algebra over a non denumerable field F, then the 
Jacobson radical J(R[x]) = 7V[x] is the maximal nil ideal of S[x], and N is the 
maximal nil ideal of R. 

One conjectures that in all cases J(JR[x]) is the maximal nil ideal of R. 
This would follow immediately if one could supply a positive answer to the 
still-open problem of Levitzki which requires to show that every nil ring is 
locally nilpotent, since in that case N — s<r(R) will hold for every ring. 

2. The lower radical. Let L(R) denote the lower radical of the ring R. 

From the results of (2, Corollary 2.2), we know that the lower radical arises 
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from a property L of rings. Recall that a ring R is an L- ring if every non zero 
homomorphic image of R contains non zero nilpotent ideals. The property L 
satisfies the same requirements of (1) as the property of quasi-regularity: 
namely, L( = or* in the notations of (1)) is an SRZ-property of rings. We have 
also in this case: 

Lemma 1L: Let L — (/?[*]) C\ R; then L(R[jc]) 9 ^ 0 implies L ^ 0. 

Lemma 2L: L(i?[x]) = L[x). 

The proof of the two Lemmas follows in parallel lines the proof of Lemma 1J 
and Lemma 2 J , except that at one place in the proof of Lemma 1J we have 
used the definition of quasi-regularity and not the general requirements of an 
SR property. The proof of this point for the lower radical is what remains to 
complete the proof of the present two Lemmas. That is: we have only to show 
that “if f(x) 6 R v [x v — x] belongs to the lower radical of i? p [x], then it belongs 
also to the lower radical of R P [x p — x].” Indeed, the ideal generated by f(x) 
in R p [x p — x] is a subring of the ideal generated by f(x) in /?„[*]. The latter 
is an L-ideal, since f(x) 6 L(R p [x\). By Corollary 2.2 of (2), it follows that 
subrings of L-rings are L-rings; hence the ideal generated by/(x) in R p [x p — x] 
is also an L-ideal. Consequently, f(x) 6 L(R p [x p — jc]). This completes the 
proof of Lemma 1L and, therefore, also of Lemma 2L. 

In parallel to Lemma 3J, one has to characterize the ideal L. In the present 
case we can show that L = L(R). 

Theorem 3. L(i?[x]) = L(i?)[x]. 

Indeed, since L = L(R[x]) r\ R Q L(R[x]) and L is an ideal in R, it follows 
by Corollary 2.2 of (2) that L is an L-ideal in R. Hence L C L(R). The 
converse L(R) C L will follow immediately from the following: 

Lemma 3L. If S is an L-ring then 5[jc] is also an L-ring. 

Indeed, let 5[x] —►£[*] be a homomorphism of S[x] onto a ring 5[x] . This 
homomorphism induces a homomorphism of 5 onto a ring § Q ,5[jc]. If x 
denotes the image of x, then clearly 5[.r] = S[x]. Thus if S\x\ j* 0 then S ^ 0. 
Since 5 is an L-ring, S c onta ins a non zero nilpotent ideal Q. Consequently, Q[x\ 
is a nilpotent ideal of 5[x], which proves that S[x] is also a ring. 

To complete the proof of Theorem 3, we note that Lemma 3L implies that 
L(i?)[xj is an L-ideal. Hence L (£)[*] C L(R[x]) = L[x). Thus L(R) C L. 

3. Infinite sets of indeterminates. Let R[x a \ be the ring of all poly- 
nomials in a set of a indeterminates {x<} where a is any cardinal number. 
A simple induction procedure, or a proof similar to that of Lemma 2J, yields 

Theorem 4 (a). L(R[x a ]) = iV a [3c a ] where N a = L(L[x a ]) C\ R is a nil ideal 
and iV a 3 s<r(R) for all /3 < a. 

(b) L(RM) = L(i?)[* a ]. 
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Furthermore, we have 

Theorem 5. Let a be an infinite cardinal, then 7(i?[.r«]) = JV a |aea] is the 
maximal nil ideal of i?[x a ] and N a — Np for all 0 > a. If R is an algebra over 
an infinite field, then N a — LBI(R). 

Let Xi be an indeterminate of the set {xj} and let {*/} denote the rest of 
the indeterminates. Since a is not finite, the sets {Xj} and {#/} have the same 
cardinal number. Hence = iV*^] and /(/?[*,]) = #«[*<]. Clearly 

= 7?' [jci] where R' = 72[x/]. It follows now by Theorem 1, that 

7(i?'[3d]) = N'[xi], N' = R' nj(R'[ Xl ]). 

Since /(i?'[;ci]) = /(i?[3c<]) = iV*^], it follows that N' = iV a [a:/]. By Theorem 
1 it follows that N' is a nil ideal. Since and {x/} are of the same cardinal 
number, one obtains iVafr/] = #«[**]. Consequently, iV*^] is a nil ideal; 
thus /(i?[*a]) is a nil ideal and, therefore, it is the maximal nil ideal of i?[* a ]. 

Let be a set of indeterminates of cardinality a > iV 0 and let {yfi 
be a finite set of new indeterminates. Since the cardinality of the set {x t } 
and is a, we have y } \) = iV a [x*, yfi, /(2?[ie<]) = iV a [Xi] where 

N a = R C\ J(i?[x f ]) = R r\ J(R[x i} yfi). By the preceding proof it follows 
that NcXxujj] is a nil ideal. Hence, the ring of all polynomials over iV a [x<] 
in any number (finite or non finite) of indeterminates is a nil ring. Clearly, 
the non finite case can be reduced to the finite case which has just been proved. 

Now let 0 > a and let {#<} be a set of indeterminates of cardinality a and 
{x it zf\ a set of indeterminates of cardinality 0. By the previous remark it 
follows that N a [Xi t Zj] is a nil ideal, hence N a [xt, Zj] C J(R[x it Zj]). On the 
other hand zd) = Nplxu zA: hence, N a C Na. Since 0 > a, it follows 

by Theorem 4 that Np C N a . Thus Np = N a . 

Let R be an algebra over an infinite field F and let a i, ... , a„ be a finite 
set of elements of iVo. Since a is an infinite ordinal, we have a finite set of 
indeterminates Xi , . . . , x H £ {#,{ and thus, a^x i + . . . + a n x n 6 iV a [ac a ]. 
It follows by the previous result that (apci + • • • + = 0 for some 

integer m. This immediately implies that the module generated by the set 
(a i, . . . , a n ) contains nil elements of index <m. Consequently, N a C LBI(R), 
and the fact that N a 2 LBI(R) completes the proof of the theorem. 

4. 7r-radicals. We follow in this section the notation of (1) and (2). 

The similarity between the proofs of Lemma 1 J , 2J and Lemmas 1L, 2L 
exhibits the generality of the methods used. The only place where the explicit 
definitions of the quasi-regularity and the L- property were involved was in 
proving that if f(x) 6 R P [x v — x] belongs to the radical considered of 
then it belongs also to the same type of radical of R p [x p — *]. The proof of 
this fact for the L-property uses only the fact that a subring of an L- ring is 
an L- ring. This condition for arbitrary properties it was denoted in (1) as 
( D ,). Thus we have: 
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Lemma 4. If x is an RZ-property satisfying (D,) then x(i?„[*]) D i? p [x p — x] 

Qtt(R p [x p — *]). 

The methcxl used in proving Lemmas 1L, 2L and Lemmas 1J, 2J, yields 
also 

Theorem l7r. If t is an RZ-property and R is an algebra over an infinite 
field or of characteristic zero , or x satisfies the condition that x (/?„[*]) C\ R p 
[x p — x] C tt(R p [x p — »]) then: x(i?[x]) ^ 0 implies that x (/?[*]) C\ R 0. 

Theorem 2x. If ir and R are the same as in the preceding Lemma , then 
7t(-R[3c]) = P[x ] where P = x (•/?[#]) C\ R. 

One readily verifies also, as in the proof of Theorem 3, that: 

Theorem 37t. If -ir and R are as above and if x satisfies the condition that a 
polynomial ring 5[x] over a x-ring S is also a ir-ring then x(7?[jc]) = x(i?)[jc]. 

Properties satisfying the conditions of Theorem lx are readily seen to be 
nillity, locally finiteness and locally nilpotency. The latter satisfies also 
Theorem 3x. 
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CONICS AND ORTHOGONAL PROJECTIVITIES 

IN A FINITE PLANE 


W. L. EDGE 

1. Introduction. The ternary orthogonal group of projectivities over a 
finite field leaves a non-singular conic x invariant, but the geometry conse- 
quent thereupon does not appear to have been investigated. The group is 
isomorphic to a binary group of fractional substitutions over the same field 
and this fact may, since these binary groups and their subgroups are so well 
known, have inhibited projects to embark on a detailed description of the 
geometry of the ternary group. While, however, one may concede that no new 
intrinsic knowledge of the group can be gained, different representations of 
the same abstract group are apt to portray some of its attributes from different 
aspects and to display in different settings interrelations among its properties; 
and if one recalls the situation in the real or complex field the incentive to 
initiate some investigation becomes compelling. 

The representation, over the real or complex field, of the points of a line X 
by those of a conic r is now commonplace and goes back at least as far as 
Hesse. The involutions of pairs of points, as well as harmonic sets, seem more 
appositely carried on r than on X. The Pascal property of T is simply, in 
essence, a statement about three involutions having a pair in common; but 
although these involutions can be carried on any rational curve, and the 
Pascal property interpreted in that context, it will be generally agreed, and 
not merely on historical grounds, that the conic is the most appropriate 
setting for it. The representation, too, of harmonic pairs on X as pairs on T 
whose joins are conjugate has its advantages, and no apology is needed for 
undertaking some account of the corresponding representation when the base 
field is neither the real nor the complex field but a Galois field. 

The paper falls into three sections. In the first (§§2-9) the foundations of 
the figure are laid and its fundamental properties established. It is explained 
how the points of the plane fall into 3 disjoint classes according as they are 
exterior to, on, or interior to x; this phenomenon is known (10), but we 
proceed to discuss the pairing, on various lines, of conjugate points. This 
pairing is basically relevant, and the description of it has to take account 
of whether or not —1 is a square in the base field. The number of canonical 
triangles — triangles, that is, in reference to which x is given by equating the 
unit quadratic form to zero — is calculated. 

The second section (§§10-17) introduces the orthogonal group of pro- 
jectivities and stresses the presence in it of involutions (of two kinds) and 
octahedral subgroups. The subgroup, of index 2, which subjects the points 
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of x to even permutations is the main focus of interest and a criterion is given 
for the octahedral subgroups to belong to it. They do, or do not, belong to it 
according as it permutes the canonical triangles intransitively or transitively. 
Other subgroups are found as the stabilisers of points in the plane. 

The third section (§§18-32) is devoted to a detailed description of the geo- 
metry when the base field is GF(p) and p — 5, 7, 11. For these values of p, 
but not for any higher values, the orthogonal group has a representation as a 
permutation group of degree p; such representations are found in the 3 planes. 
The geometry has many features of interest, such as the multiple perspec- 
tivities between certain pairs of canonical triangles and, when p = 11, the 
distribution of the points external to x i n se ts of 6, the 15 joins of points of 
such a set being all skew to x and concurrent in threes at 10 different points 
all internal to x- 


Conics and their Canonical Triangles 

2. The number q of marks in a finite field F is always a power of a prime p. 
Every non-zero mark satisfies x* -1 = 1, and there always occur primitive 
marks of which no power lower than the (q — l)th is 1. All the non-zero 
marks are powers of any primitive mark j. VVe suppose throughout that 
p > 2. Then j cannot be the square of any mark of F because, if j = i 2 , 

ji(Q-l) — i 9-1 — l t 

contradicting the primitiveness of j. Nor can any odd power of j be a square; 
it is impossible to extract a square root of any odd power of j without enlarging 
F. All even powers of j, on the other hand, are clearly squares of marks of F. 
The non-zero marks are thus half of them squares and the other half non- 
squares. 

The product and quotient of two non-squares are always squares. 

Take, as an example, q = p = 7 . We may label the marks 

-3, -2, -1,0, 1,2,3 

and regard them as the residue classes to modulus 7. The primitive marks are 
3 and —2. The squares are 

1 = 3 8 = ( — 2) 6 , —3 = 3 4 = (-2) 2 , 2 = 3 2 = ( — 2) 4 , 

while the non-squares are 

-1 = 3 3 = ( — 2) 3 , 3 = 3*= ( — 2) 6 , —2 = 3 5 = (-2)*. 

It is important, with a view to the geometry, to distinguish between fields 
wherein —1 is, or is not, a square. Since —1 is this power of j not being 

1 and yet a square root of 1, 

— 1 is a square whenever q = 1 (mod 4), 

— 1 is a non-square whenever q = — 1 (mod 4). 
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3 . The marks of F will serve as homogeneous coordinates of points and lines 
in a plane; each point or line answers to a vector of 3 components not all of 
which are 0. There are q 3 — 1 such vectors; but the q — 1 non-zero multiples 
of any given vector represent the same point, or line, so that the plane consists 
of 

(ff* -D/(?- 1) =q* + q+l 

points and of the same number of lines. When it is necessary to distinguish 
point and line the coordinates of a point may be written as a column vector 
and those of a line as a row vector. The number of points on a line and of 
lines through a point is 

(? ! - 1 )/(? - 1) = q + 1. 

4 . We take for granted ( 3 , p. 158) the fact that every non-singular conic 

can, by appropriate choice of the triangle of reference, be given by equating 
to zero the unit quadratic form x 2 + y 2 + z 2 . Let us, before embarking on the 
main task of exposition, enquire into the geometrical significance of this 
canonical form. It certainly refers the conic to a self-polar triangle, but there 
is more to say than this because a conic has self-polar triangles which, when 
used as triangle of reference, do not permit this canonical form unless F is 
enlarged. The complete explanation has to take account of whether or not — 1 

is a square in F. The line x = 0 meets the conic where y 2 + z 2 = 0; if — 1 is a 

square this yields two intersections, whereas if — 1 is not a square there are 
no intersections; and the like occurs on y = 0 and on z = 0. If we describe 
any triangle which permits the canonical form x 2 + y 2 + z 2 = 0 as a canonical 
triangle and denote it by A then 

if q = 1 (mod 4) the sides of any A are all chords of the conic, 

if q = — 1 (mod 4) the sides of any A are all skew to the conic. 

The number, q(q 2 — l)/24, of A is found below in §§7, 8. 

Note that there is, on any side of any A, a unique pair of points that is both 
harmonic to the vertices of A and conjugate for the conic; on x = 0 this pair 
is given by y 2 = z 2 , and that whether x = 0 is a chord or is skew to the conic. 
The three such pairs on the sides of a A are the vertices of a quadrilateral Q 
having A for its diagonal triangle; A and Q each determine the other uniquely. 
When A is the triangle of reference the sides of Q are 

x + y -f z = 0, —x -f- y + z = 0, x — y z — 0, x -\- y — z = 0. 

5. Let x denote the conic x 2 -f- y 2 -f- z 2 = 0. The polar of P{a, /3, 7 ) is 
ax + Py + 72 = 0, and passes through P if and only if P is on x". X is the 
aggregate of points that lie on their own polars. 

If the polar of P passes through P'{a , jS', 7 ') then aa + /3/3' + 77 ' = 0 
and the polar of P' passes through P\P,P' are then conjugate with respect to x- 
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Does the polar of P meet x? At least one coordinate of P, say y, is not zero; 
then, for points of x on the polar, 

Y 2 (* 2 + y 2 ) 4* (ax + 0y) 2 = 0, 

( 7 2 + a 2 )x 2 + 2 a&xy + ( 7 2 + 0 2 )y 2 = 0. 

The discriminant of this quadratic is 

« 2 0 2 — ( y 2 + a 2 ) ( 7 2 + 0 2 ) a — 7 2 (a 2 + 0 2 + 7 2 ), 

so that the quadratic has, or has not, roots in F according as —a 2 — 0 2 — 7 2 
= — 2a 2 is, or is not, a square. 

If —2a 2 is a square we call the polar a c-line or chord , and say that P is 
external to x- 

If 2a 2 = 0, P is on \ and the polar a /-line or tangent. It does not meet x 
elsewhere. 

If — 2a 2 is a non-square we call the polar an 5-line; it is skew to x> and P 
internal to x- 

This separation by a conic of the points of a plane into disjoint classes i6 
noted by Qvist (10, pp. 9 and 19) but he does not proceed further, save to 
remark on the numbers of tangents through the points. If a tangent passes 
through P then the polar of P passes through the “contact” of the tangent, 
and conversely; hence there pass 

two /-lines through any external point, 
one /-line through any point of x, 
no /-line through any internal point. 

We may call external points e-points, and internal points i-points . 

Every /-line consists of q + 1 points; one is the contact, but the remaining 
q have all to be e. It follows, on polarising, that there are q -f- 1 lines through 
any point of x» one line being the tangent and the remaining q all c. Hence, 
since q chords pass through any point of x» X consists of q + 1 points. Since 
the number of c-lines is \q(q -f 1) and of /-lines is q + 1, the number of 
5-lines is 

q 2 + q + \ - \q(q + 1) - (q + 1) = \q(q - 1), 

and this must also be the number of i-points. Thus x separates the q 2 + q + 1 
points of the plane into disjoint batches of 

\q(q +i). q + 1, \q(q - l) 

and likewise the q 2 -f- q + 1 lines into these numbers of c-lines, /-lines, 5-lines, 
respectively. 

Any two /-lines intersect, and the §<?(<? + 1) c-points are thus accounted for. 

6. Of the q — 1 points of a c-line, not on x> half are i and half are e. For let 
(ai, 0i, 7 i) and (a 2 , 02 , yi) be any two distinct points of x', any point on the 
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c-line which joins them is (ai + kaz, 0 i + fc 0 2 , 71 + kyz) where k runs through 
the q — 1 non-zero marks of F. This point is c or i according as 

— 2(ai + £<* 2 ) 2 — - 2kXa\az 

is, or is not, a square; it cannot be zero since the point is not on x- But when 
k runs through the q — 1 non-zero marks, —2k’Eaia 2 does likewise, and since, 
of these marks, § (q — 1 ) are squares and the others not, it follows that, 
of the q — 1 points of the c-line not on x. i(q ~ 1 ) are c and the others i. 
It follows too, on polarising, that through each e-point there pass, with 2 
/-lines, — 1) c-lines and \{q — 1) 5-lines. 

Since there are \{q — 1) z’-points on each of the \q(q + 1) c-lines there pass, 
through each «-point, 

\ q(q + i) • 5(2 - 1 )/k(ff - i) = £(? + i) 

c-lines, and so \{q + 1) 5 -lines too. Polarisation then discloses that, of the 
5+1 points on any 5-line, half are e and half i. 

7. Call two points, neither of them on x» similar if they are either both e 
or both i ; otherwise opposite. 

Consider the pairing, as conjugate to one another, of the q — 1 points on c 
that are not on x- Any conjugate pair is given by 

(«i ± kaz, /3i ± & 02 » 7 i ± kyz) 

for some non-zero k. Now the marks db2fe2aia 2 are both squares or both 
non-squares if — 1 is a square, whereas if — 1 is not a square one of the two 
marks is a square and the other not. Hence, for — 1 not a square, the con- 
jugates of the 5(5 — 1 ) c-points on c are the \{q — 1 ) r-points on c; conjugate 
points on c are opposite. If, however, —1 is a square conjugate points on c 
are similar; the \{q — 1 ) c-points consist of £(5 — 1 ) conjugate pairs, as do 
the |(g — 1 ) i-points. Let, —1 being a square, the pole of c be c 0 , and let e\ 
and e 2 be any one of the \(<q — 1 ) pairs of conjugate c-points on c; then each 
vertex of the triangle c 0 eie 2 is an c-point and the triangle, being self-polar for 
X, is a canonical triangle A. Since we may choose c, with its pole, in \q{q + 1) 
ways and, thereafter, take any of the \(q — 1 ) conjugate pairs of c-points on 
c the number of A is, since each of its 3 sides may be used to begin its con- 
struction, 

iff (2 + 1) • Mff “ !) • i = ?(? 2 - l)/24, 

and each c-point is a vertex of \{q — 1 ) of them. The lowest value of q for 
which — 1 is a square is 5; there are then 5 A whose 15 vertices account for the 
15 c-points just once. When q = 9 there are 30A, each of the 45c-points being 
a vertex of 2 of them. 

8. The relation between conjugate points on 5 can be deduced from that on c. 
Suppose that e x and c 2 , two external points on 5, are conjugate; their polars 
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ci and c 2 pass through e 2 and ei, respectively and meet at to, the pole of s. 
Hence conjugate points on c are opposite and —1 is not a square. But if e\ 
and i% are conjugate points on s their polars C\ and s 2 pass through t 2 and ei 
respectively and meet at i 0 ; hence conjugate points on c are similar and — 1 a 
square. It follows that, when —1 is a square, conjugate points on 5 are op- 
posite; the conjugates of the \(q -f 1) t-points are the \(q + 1) e-points. 
But if — I is not a square conjugate points on s are similar; there are \ (q + 1) 
conjugate pairs of i-points and + 1) °f e-points. 

When — 1 is not a square each A has 5-lines for its sides and t-points for its 
vertices. In order to construct a A we may choose any one of the \q {q — 1) 
5-lines as a side, and thereafter any of the \ {q + 1) pairs of conjugate i-points 
on it as vertices. Since the construction may set out from any of the 3 sides 
the number of A is 

\q{q - 1) . \{q + 1) . | = q(q 2 - l)/24, 

and each f-point is a vertex of £(<? + I) of them. The lowest value of q for 
which —1 is a non-square is 3; there is then a unique A and its vertices are 
the only i-points in the plane. When q = 7 there are 14 A, each of the 21 
i-points being a vertex of 2 of them; when q = 11 there are 55A, each of the 
55 i-points being a vertex of 3 of them. 

9 . Let ABC be any A and take e, distinct from B and C whether the vertices 
be e-points or f-points, on BC (there is no such e if q = 5). The /-lines through 
e are harmonic to eBC and eA, and harmonic inversions in vertices and op- 
posite sides of ABC yield a second pair of /-lines whose intersection is the 
harmonic conjugate of e in regard to B and C. These 4 /-lines form a quadri- 
lateral U with A BC as diagonal triangle; U is the same as Q if p = 3, though 
not otherwise. 

When the vertices of A are e the e-points on BC afford j(g — 5) pairs har- 
monic to B and C; A gives rise to }(<7 — 5) U, of which it is the diagonal 
triangle, whose sides account for all q T 1 /-lines save those 6 which pass 2 
through each of A, B, C. Every A provides such a partitioning of the /-lines. 
The lowest relevant values of q are 9 (when the U are also Q ) and 13. 

When the vertices of A are i the partitioning of /-lines is simpler; each A 
gives rise to 1(? + 1) U, the e-points on any 5 -line falling into £(9 + 1 ) 
pairs harmonic to the vertices of any A of which this 5 -line is a side; these 
e-points can be paired not only in the involution / 0 of pairs conjugate for x, 
but in £(<7 + 1) involutions I k each having the vertices on 5 of a A for foci. 
No two of these |(g + 5) involutions are the same, and 7 0 commutes with all 
the others since the foci of any of these form a pair of I 0 . When q — 7 the 
pairing of the 4 e-points is as follows: 

e it e'i and e 2 , e' 2 in / 0 ; 

ei, e 2 and e\, e' 2 in A, with foci i 1 , i\; 

ej, e' 2 and e\, e 2 in / 2 , with foci t 2 , i\. 


Copyrighted material 


368 


W. L. EDGE 


Here f 2 , i ' 2 must be a pair of 1 1 ; t'i, i'\ a pair of / 2 ; not only do h and I 2 com- 
mute with 1 0 , they commute with each other. Their product, in either order, 
is /o as is seen by observing the permutations imposed on the e when / 1 and / 2 
act in succession. 

The Ternary Orthogonal Group of Projectivities 

10. Suppose now that a projectivity leaves x invariant. It must permute 
the A among themselves, so that the sides x = 0, y = 0, z = 0 of any given 
A become the sides £ = 0, 17 = 0, f = 0 of (the same or) some other A. Here 
£, 17 , f are linearly independent linear forms in x, y , z; and since x admits both 
the equations 

X 2 + y 2 + z 2 = 0 , e + 17 2 + r 2 = 0, 

the left-hand side of either equation is a scalar multiple of the left-hand side 
of the other. Thus 


M SS 
*— < 

Y 

V 

= M 

X 

y 




_z_ 


where M is a three-rowed non-singular matrix whose elements are all in F, 
and 

x'x = X 2 + y 2 + z 2 = A (£ 2 + 77 2 -f t 2 ) = XH'H = Xx'M'Mx, 

so that 

10.1 XM'M = I, 

the unit matrix. Here X is a mark of F ; indeed it is a square because, on taking 
determinants in 10 . 1 , 

X 3 jM | 2 = 1. 

The projectivity is, however, unaffected if M is replaced by any scalar 
multiple of itself; if H = tw - 1 M with m either square root of X then, from 10.1, 

H'H = I. 

Then |H | 2 = 1, and we choose m to be that square root of X for which |H| 
is + 1 ; the projectivity is imposed by an orthogonal matrix of determinant 
+ 1 . Conversely: this matrix is uniquely determined. For the only matrices 
which impose the same projectivity as H imposes are those of the form «H 
with o) a non-zero mark of F\ the orthogonality condition demands that 
w 2 = 1 and the determinantal condition that co 3 = 1 , which together require 
w = 1. 

These projectivities, as likewise the unimodular orthogonal matrices that 
impose them, form a group Q(3, q ): the orthogonal group in 3 variables over 
F. 
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11. Note, in passing, the involutions in 8(3, q ), namely the harmonic inver- 
sions whose centre and axis are pole and polar for x* Since the matrix imposing 
such an involution satisfies H 2 = I as well as H'H = I it is symmetric as well 
as orthogonal. There are \q(q + 1) hyperbolic involutions whose centres are 
e-points and axes c-lines; the q — 1 points of x not on the axis are transposed 
in pairs. There are \q(q — 1) elliptic involutions whose centres are f-points 
and axes s-lines; the q -f 1 points of x are transposed in pairs. One of the two 
consecutive integers \{q ±. 1 ) is odd so that there are always in 8 ( 3 , q) involu- 
tions that impose odd permutations on the points of x — the hyperbolic ones 
if q = — 1 (mod 4), the elliptic ones if q = 1 (mod 4). 


12. The conditions, expressed by H'H = I, for 


H = 


ai a 2 a 3 
01 02 03 

jyi 72 73. 


with all its elements in F, to be orthogonal are 


12.1 2«! 2 = 2 a 2 2 = 2 a 3 2 = 1 , 

12.2 2a2«3 = 2a 3 ai = 2axQ!2 = 0. 


These conditions can be interpreted geometrically when each column of H 
is regarded as the coordinate vector of a point of the plane; 12.2 then demands 
that the 3 points form a self-polar triangle for x and 12.1 that their coordinate 
vectors be normalised. It is not possible to normalize any vector unless 2 a 2 
is a square; hence if —1 is a square internal points, and if —1 is a non-square 
external points, cannot have their coordinates normalised. But when it is 
possible to normalise a vector it admits two normalised forms ±(a, 0 , 7 ). 

If —1 is a square, each column of H is one of two normalised coordinate 
vectors of one of three mutually conjugate e-points; that is, the columns of 
H answer one to each vertex of a A. The vertices of A can be taken in any 
order and, with this order chosen, four of the eight combinations of sign are 
permitted by the stipulation that |H| = + 1 . Hence the number of such 
orthogonaL matrices is 

g(g8 2 ~ 1} . 3! . 4 = q(i - 1). 

If —1 is not a square, the calculation leads to the same result; it is governed 
by the columns representing vertices of a A and is not affected by these vertices 
being e or i. The order of the group 8(3, q) is q(q 2 — 1). 

The triangle of reference is itself a A, and the 24 matrices, obtained from I 
by imposing the 3! permutations on its columns and using the 4 choices of 
sign permitted for each permutation, form that subgroup of 8 ( 3 , q) for which 
the triangle of reference is invariant. It is an octahedral subgroup; indeed it 
acts as the symmetric group @ 4 on the sides of the Q associated with the 
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triangle of reference, imposing all 4! permutations on them. When q = 3 this 
©4 is the whole orthogonal group; otherwise it is one among q(q 2 — l)/24 
octahedral subgroups of 12(3, q). The involutions in © 4 are those three whose 
centres are vertices of A and those six whose centres are vertices of Q. The 
former answer to the diagonal matrices diag(l, —1, — 1), diag( — 1, 1, — 1), 
diag( — 1, —1,1) and the latter to the matrices 

[ —ioo“] r — i o o' 

0 0 1 , 0 0 -1 

0 1 oj L 0-1 0 . 

0 1 0“ 

10 0 , 

0 0 -1_ 

13. 12(3, q) acts as a permutation group on the q + 1 points of x‘- should 
any of its operations subject these points to an odd permutation precisely one 
half of them must do so, and those operations that impose even permutations 
will then form a normal subgroup of index 2. We have, however, already noted 
the presence, in every group 12(3, q), of involutions that impose odd permuta- 
tions; hence 12(3, q) has a normal subgroup 12 + (3, q) of order \ q(q 2 — 1). 
Those involutions that belong to 12 + (3, q) are the \q{q -f 1) hyperbolic involu- 
tions if q = l(mod 4), whereas they are the \q(q — 1) elliptic evolutions if 
q = — l(mod 4); in other words they are always those involutions whose 
centres are vertices of A. 

14. There is a criterion which decides whether the octahedral subgroups of 
12(3, q) are also contained in 12 + (3, q): they will not be so contained unless the 
vertices of Q are similar to those of A, for 12 + (3, q) only contains either hyper- 
bolic or elliptic involutions, never both. On the other hand, it does contain all 
the involutions of one of the two types. The test of — 2a 2 being, or not being, 
a square establishes that the vertices of A are similar to those of Q if, and only 
if, 2 is a square; for — 2a 2 is — 1 at the vertices of the triangle of reference and 
— 2 at those of its associated Q. When q is a prime p the similarity of the 
vertices requires that, in the common phraseology, 2 is a quadratic residue; 
this occurs ( 11 , p. 110) whenever p = ± l(mod 8), but not when p = ± 3 
(mod 8). 

It is clear from §12 that 12(3, q) permutes the A transitively; a given A is 
then invariant for 

?(s ! - 1) + = 24 

projectivities of 12(3, q) and they form one of the octahedral subgroups. 
But if 12 + (3,<z) permutes the A transitively only 12 of its operations leave a 
given A invariant; they form a subgroup of index 2 in an octahedral group — 
a tetrahedral group that imposes the 12 even permutations on the sides of 
the associated Q. Should, therefore, 12+(3, q) contain the octahedral subgroups 
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it cannot act transitively on the A, which must then fall, under 12+(3, q), into 
two transitive sets of q(q 2 — l)/48 each, sets which form two systems of 
imprimitivity for 12 ( 3, q) and which are transposed by any projectivity of 
12(3, q) that is outside 12+(3, q). 

15 . The rules (see §12) by which matrices of 12(3, q) are formed show that 
the group is transitive not only on the A but also on those points that can 
serve as vertices of A; hence any such point B, and its polar b, are latent for a 
subgroup 12 b , the stabiliser of B in 12(3, 3), of order 

q{q 2 “ 1) 4- %q(q ± 1) = 2 (q =F 1), 

the upper or lower sign occurring according as B is e or i. 12 B includes all the 
involutions whose centres are on b\ they account for q =F 1 of its operations 
and their matrices all have the coordinate vector of B latent, with multiplier 
— 1. The other q =F 1 operations of 12 B form the group 12 B + for which the 
coordinate vector of B is invariant, being associated with a latent root +1. 
12 is isomorphic to the binary orthogonal group which it induces on b, and 
we now show that it is cyclic. That it is abelian follows at once from the form 
of the 2-rowed orthogonal matrices of determinant 1 (3, p. 169); it can be 
asserted to be cyclic once the presence in it is detected of an operation whose 
period is the order of 12 B +. 

The binary orthogonal group consists of all matrices 

with m 2 + v 2 = 1 and both u, v belonging to GF(q). Since U 2 = 2«U — I it 
follows (cf. 11 , p. 368) that, if h is a square root of —1, 

2hv\J n = { (u -f hv) n — (w — — { (w + hv)*~ l — (u — /to)* -1 } I, 

and then that U n = I if, and only if, 

(u + hv) n = (u — hv) n = 1. 

Should B be an e then h 6 GF(q) and every U satisfies U* -1 = I; in order to 
find U with period q — 1 it is only necessary to choose u + hv, and therewith 
its reciprocal u — hv, to be a primitive mark. 

If, however, B is an i then u ± hv do not belong to GF(q) but to a quadratic 
extension GF(q 2 ); they are conjugate marks therein, each the gth power of the 
other. Hence 

(u + hv) Q+l = (« + hv)(u + hv ) 9 = (m -f hv)(u — hv) = 1, 

and every U satisfies U ff+1 = I. A matrix of period q + 1 is found by choosing 
u *4“ hv in GF(q 2 ) to be a primitive root of x 9+l = 1. The mark h, having served 
its purpose, falls out of the working and leaves only marks of GF(q) in the 
final result. 

Two examples may perhaps be given, with details of the calculations left 
out. 
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The quadratic X 2 = X + 1 is irreducible over GF{ 7) and the adjunction 
of either root f extends the field to GF(7 2 ). A primitive root of x 8 = 1 in GF( 7 2 ) 
is f 2 , and so we take 

U + = f 2 , « — f *v = $“ 2 , 

which give u = v = — 2; 

has period 8 over GF(7). 

The quadratic X 2 = AT — 1 is irreducible over GF{\ 1) and the adjunction 
of either root 0 extends the field to GF(11 2 ). A primitive root of x 12 = 1 in 
GjF( 11 2 ) is 20 + 2 and we take 

u + (40 - 2)» = 20 + 2, u — (40 — 2)0 = -20 + 4, 
which give u = 3, v = — 5; 

has period 12 over GF(ll). 

The operation of period 2 in 12 B+ is manifestly the involution with B as 
centre and this, since B is vertex of a A, belongs to 12+(3, q). But not all 
operations of 12 B+ so belong, and hence only half of them will do so. For 12 B 
certainly contains operations outside 12 + (3, q ), namely those centred at Hs'Tl) 
points on b opposite to vertices of A; and when all involutions centred on b 
are jettisoned from 12 B to leave 12 B+ only half these are outside 12 + (3, q) and so 
not all the q =F 1 operations of 12 B outside 12 + (3, q) are rejected. The conse- 
quence is that 12 + (3, q) has subgroups 12£ of order gT 1 and cyclic subgroups 
12 b + of order %{q =F 1), and is transitive on vertices of A. 

It is perhaps not superfluous to remark that, as the involution centred at 
B imposes the identity projectivity on the points of b, the groups of pro- 
jectivities on b are of orders one half those of the groups of orthogonal mat- 
rices; 12 b+ imposes |((Z =F 1) projectivities on b and is in (2, 1) homomorphism 
with this latter group. 




16 . There is a corresponding discussion for points D, opposite to vertices 
of A, and their polars d\ 12(3, q) has subgroups 12 D of order 2(q ± 1) and cyclic 
subgroups 12 D+ of order q ± 1; 12 + (3, q) has subgroups 12© of order q ± 1 and 
cyclic subgroups 12£ + of order \{q db 1). There is a difference in that the 
“restriction” of the ternary quadratic form to the line d does not have the 
canonical form £{ + £*» that it had on the side of a A, but + vijl where v 
is any fixed non-square of GF(q). When the coordinates are transformed to 
correspond thereto the binary orthogonal projectivities answer to matrices 
(3, p. 161) 

u=r -'I. 

[_ — w uj 

wherein u 2 + vv 2 = 1; but these also satisfy U 2 = 2 mU — I. 
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17 . The group ft+(3, q) is (3, p. 164) isomorphic to the linear fractional 
group LF( 2, q ), and the subgroups that have just been obtained in the ortho- 
gonal representation were found in the linear fractional representation by 
Serret for the case q = p ( 11 . pp. 375, 379, 380; results which were given also 
in the earlier editions of this treatise) and by Dickson for q a power of a prime 
(3, pp. 263-4). The 3 involutions centred at the vertices of any A are mutually 
commutative and form, with the identity, a 4-group; the orthogonal repre- 
sentation thus discloses the q(q 2 — l)/24 4-groups in ft+(3, q) at a glance. 
They are, of course, well known in the linear fractional representation (3, 
p. 268; 1, p. 444). Serret also obtained the p 2 involutions of the group of linear 
fractional transformations, pointing out ( 11 , p. 382) that the number in 
LF{ 2, p ) is \p(p + 1) or \p{p — 1) according as p = 1 or — 1 (mod 4). 

The Details of the Geometry over the Smaller Fields 

18 . The remaining sections of the paper are given to describing the geometry 
for the smaller fields q = 3, 5, 7, 11. The figure for q — 3 has been described 
elsewhere ( 4 ); the A and Q therein are unique, and the sides of Q are the 
tangents at the 4 points of x- &(3, 3) is the octahedral, ft + (3, 3) the tetrahedral, 
group and the points of x and sides of Q undergo the corresponding permuta- 
tions. The subgroups Q B , one for each vertex of A, are the dihedral subgroups 
of order 8; fi B + the cyclic subgroups of order 4. There is only a single ft B , 
namely the 4-group that is a common subgroup of the 3 dihedral ft B , but there 
are 3 cyclic groups S1 B+ of order 2. 

19 . Some description has also been printed ( 5 ) of the figure for q = 5, 
although in quite a different context and using a different nomenclature. 
An account of this figure from the standpoint of the present enquiry is therefore 
given now. Each of the 15 c-lines is a side of one, and only one, A; and since 
no vertex e of any A lies on x the c-lines through a point of x belong one 
to each of the 5A. The 6 points of x are separated by the sides of any A into 
3 pairs of a syntheme (i.e., 3 pairs which together account for all 6 points) and 
the 5 synthemes, one arising from each A, constitute a synthematic total T 
(i.e, 5 synthemes which together account, by 3 pairs in each, for all 15 pairs of 
the 6 points). 

Since there are 3 c-lines through each i-point the points of x fall, in 10 dis- 
tinct ways, into 3 pairs which, since their joins are concurrent, are in involution 
on x- Each involution yields a syntheme, and the 10 synthemes so arising are 
those extraneous to T. We may say, with Clebsch, that the points of x form a 
hexagon endowed 10 times over with the Brianchon property. 

Clebsch (2, p. 336) establishes the existence of such hexagons in the real 
projective plane; their vertices are not then on a conic, neither will they be 
when we encounter such hexagons again below with q = 11. They arose when 
Clebsch mapped his ‘diagonal’ cubic surface on the plane, the surface itself 
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having arisen by making certain transformations of a quintic equation. The 
presence of such hexagons in the real plane is however visually obvious: 
they are provided by sections of the 6 diagonals of a regular icosahedron 
in Euclidean 3-space, the Brianchon points being the sections of the 10 joins 
of centroids of pairs of opposite faces. Some approach, whether deliberate or 
not, to this aspect of the matter is made by Klein (8, p. 218) but he does not 
appear to record that mere section is enough to provide the figure. The 
simplest section, by a plane perpendicular to a diagonal of the icosahedron, 
gives a hexagon consisting of the 5 vertices and the centre of a regular 
pentagon. 


20 . Take now any two A; call them Ai and A 2 . Label the points oi \ A, 
B', C, A', B , C so that 

BC’, CA ', AB' are sides of Ai, 

B'C , C'A , A'B are sides of A 2 . 

Since no e-point can lie on sides of more than one A, BC and B'C meet at 
a point i\\ the remaining c through i\ is AA' . Hence 

AA' , BC , B'C are concurrent at i\, 

BB', CA' , C'A are concurrent at t 2 , 

CC, AB', A'B are concurrent at i 3 \ 

moreover i\, i%, i 3 lie on the Pascal line 5 0 of the hexagon AB'CA'BC. Thus 
Ai and A 2 are in fourfold perspective. Since they are both self-polar for x« 
any axis of perspective is the polar of the corresponding centre of perspective; 
the four axes of perspective are AA', BB' , CC, s 0 . The first three of these are 
concurrent at t 0 , the pole of 5 0 ; they are the c-lines which pass through i 0 and 
are sides one of each of the A other than Ai and A 2 . Every pair of the 5A is 
in this relation of fourfold perspectivity, and each of the 10 5-lines plays the 
role of So for one pair of A The 5-lines are sides of Q, and each pair of Q share 
one 5-line, namely the Pascal line of the hexagon of which their diagonal tri- 
angles provide alternate sides. 


21 . 12(3, 5) is of order 120 and subjects the 5A to all 5! permutations; the 
octahedral subgroup for which one A is invariant subjects the other 4A to 
all 4! permutations. 12 + (3, 5) is of order 60 and subjects the 5A to all even 
permutations; it has no octahedral subgroups and permutes the A transitively. 
The coordinate vectors of the 3 vertices of any A can therefore be displayed as 
columns of a matrix of 12 + (3, 5); for instance thus: 
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Indeed these matrices are symmetric and so, apart from I, represent involu- 
tions; the latent column vectors associated with the latent root +1 are found 
to be e, and so the involutions are hyperbolic and belong to 12+ (3, 5). 

Each matrix affords, by the 3! permutations of its columns and the 4 per- 
missible signings for each permutation, 24 matrices of 12(3, 5). If the permu- 
tations are restricted to be even, the 60 matrices so arising constitute 12+ (3, 5) ; 
for the 12 arising from I form the tetrahedral subgroup of 12+(3, 5) for which 
the triangle of reference is invariant and these, when they postmultiply the 
other matrices of 21.1 impose even permutations on their columns. 

The subgroups 12 are easily disposed of. If B is y = z = 0 the 8 matrices of 
12 B are 



1 0 0 
0-1 0 
0 0 - 1 . 

-1 0 0 
0-10 
0 0 1 


"l ool HO 0] f-1 0 
, 0 01 , 00 - 1 , 01 
.0 -1 0 J |_0 1 0 J L 0 0 

-1 0 0 ] [-1 0 ol 

, 0 0 1 , 0 0 -1 , 

0 1 oj L 0-1 oj 


0 

0 , 
-1 


the first 4 of which constitute 12 B+ , the first 2 12^+. The 4 diagonal matrices 
constitute 12£, and the same subgroup 12g arises for the 3 vertices of any A. 
These are the 5 4-groups in 12+(3, 5). The 15 cyclic subgroups 12^ + of order 2 
are of course generated by the hyperbolic involutions. 


22 . The presence of axes of perspectivity is an immediate consequence of 
the factorisations of linear combinations of products of sides of any two A. 
For instance: the first and fifth of the matrices 21.1 provide, over GF( 5), the 
identities 

{2x -f- y + z) (x + 2y 4 z) (x 4 y 4 2 z) + xyz 

S3 (2x — y — z)(2y — z — x)(2z — x — y) 

(2x 4- y + z) (jc + 2y + z) (x + y + 2z) — xyz 

= 2(x 4- y 4 z)(x 2 -f y* 4- z 2 ), 

and 9 other pairs of identities arise from these by applying the orthogonal 
transformations. 


23 . Suppose now that q = 7. All A and Q have t-points for their vertices, 
but whereas the sides of A are s those of Q are c. 12 + (3, 7) has 14 octahedral 
subgroups, each acting as the symmetric group of 4! permutations of the sides 
of a Q. 

Consider, for the moment, some one A. There are operations of 12+ (3, 7) 
that permute its vertices cyclically; they leave one side c of the associated 
Q invariant while cyclically permuting the 3 t-points thereon. They cannot, 
being of odd period, transpose the two points of xonc; nor can they, as not 
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imposing the identity projectivity on c, have any other latent point on c 
than these two; hence the 3 e-points on c are also permuted cyclically. Thus, 
on any c-line, the 3 f-points form an equianharmonic tetrad with either point 
of x> as do likewise the 3 e-points. The two triads are analogous to a binary 
cubic and its cubic covariant, with their common Hessian pair. 

0 

24 . The stabiliser Qt of a given i-point, called for the moment B, in 12+ (3, 7) 
is of order 8 . There are 2 canonical triangles A, A' with B for vertex and these 
are never transposed by any operation of 12^. A is invariant for an octahedral 
subgroup of 24 operations of 12 + (3, 7), and of these 8 leave B unaltered and so 
exhaust the stabiliser. The two octahedral subgroups associated with A and 
A' have this (dihedral) stabiliser in common. It follows that any operation of 
12(3, 7) that transposes A and A' lies outside 12+(3, 7). Any two A which share 
a vertex belong to different imprimitive systems. 

It is then easy, starting from any one A (say the triangle of reference A 0 ), 
to obtain all the A and partition them into two sets of 7. There are 3, say 
Ai, A 2 , A 3 , which share a vertex with A 0 and so belong to the opposite set; 
each of them shares a vertex with two A other than Ao, and the 6 A so arising 
all belong to the same set as A 0 and, indeed, complete it. We now display all 
14 A; each of the two horizontal strata consists of a set of 7 that are permuted 
transitively by 12+(3, 7). Each stratum is an imprimitive system for 12(3, 7); 
whereas both strata are invariant for 12 + (3, 7), they are transposed by those 
operations of 12(3, 7) that lie outside 12+ (3, 7). 
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2 0 2 
-3 2 3 

-10 0 

2 0 2 

-2 2 0 

2 -3 -3 

0 2 2 

0-1 0 

2 2 0 

-3 2 3 

0 2-2 

2 0-2 

0 0-1 

-3 3 2 
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3 

-3 

-2 
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2 

-3 

3 

-2 

3 

3 

-2 

3 

2 

3 

-2 

-2 

0 

2 

-2 

0 

2 

-3 

3 

2 

3 

-3 

2 

3 

3 

-3 

2 

-3 

3 

2 

-3 

3 

2 

3 

3 

-3 

2 

-3 

-3 

2 

3 

3 

2 


Each square block provides, by permutations and signings of its columns, 24 
matrices of 12(3, 7); all 336 operations of the group are thus accounted for. 
The upper stratum provides, from its 7 blocks, the 168 matrices of 12 + (3, 7); 
the unit matrix provides those 24 matrices for which the triangle of reference 
is invariant. The other octahedral subgroups occur when these 24 matrices are 
transformed, in the sense HMH -1 , by those of the other 13 blocks. 
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25 . Each block is symmetric and therefore the matrix of an involution save 
when it is the unit matrix. The involutions of fi + (3, 7) are the 21 elliptic ones; 
of these 9 are provided by the matrices given in §12. There are 12 others, of 
which 6 are furnished by the blocks precisely as displayed; the outstanding 6 
are got from these by changing the signs of those 4 marks that occur in the 
same row or column as the zero in the diagonal, these changes neither altering 
the value of the determinant nor destroying the orthogonality. The 28 hyper- 
bolic involutions must be imposed by symmetric matrices, orthogonal and of 
determinant -f-l, whose columns, either themselves or their negatives, occur 
in the lower stratum. All the assemblages 


2 2 

2 -2 

-2 2 

-2 -2 

2 -2 

-2 -2 

2 2 

-2 2 


can play the part of that one which appears in any of its first three blocks; 
this accounts for 12 hyperbolic involutions. As for the remaining four blocks, 
not only does each provide a hyperbolic involution as it stands but it provides 
three others — by transposing any two of its three columns and multiplying 
by — 1 either the untransposed column only or all three columns, whichever 
alternative is the one to restore symmetry to the matrix. 

26. It is easy to give the explicit forms for the matrices of the stabiliser 
when B is y = z = 0. There are 16 of them; those 8 of Qb+ have +1 at 
their top left-hand corner, zeros elsewhere in the top row and left-hand 
column, and the residual block one of 

1 0 -1 0 0 1 0 -1 2 -2 -2 2 2 2 -2 -2 
0 1 0 -1 -1 0 1 0 2 2 -2 -2 -2 2 2 - 2 . 

The other 8 matrices have — 1 in the top left-hand corner, and the residual 
blocks are the 8 two-rowed blocks just given but each with its bottom row 
changed in sign throughout. The 8 matrices with only 0, 1, —1 for their 
elements constitute here, in contrast to q = 3, 5, the subgroups differ 
for different vertices of the same A. 

The 4 e-points on x = 0, being those points for which 

y — 2z, y = — 2z, 2 y = z, —2 y = z, 

undergo a cyclic group of 4 permutations under when B is y = z = 0. 
Thus fi + (3, 7), transitive on the 4 c-lines through an t-point as well as on the 
21 t-points, is transitive on the 28 c-lines, and the stabiliser of a given c-line 
in ft + ( 3, 7) is of order 6. Now any c-line is a side of 2 Q, each invariant for an 
octahedral subgroup of fl+(3, 7) imposing the 4! permutations on its sides; 
there are 3! operations of this subgroup for which c is invariant and which 
impose the 3! permutations on the remaining sides. These 3! operations 
exhaust the stabiliser: any projectivity of f2(3, 7) that transposes 2 Q that share 
a side must lie outside fl + ( 3, 7). The 14 Q fall, with their diagonal triangles, 
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into 2 imprimitive systems of 7, and 2 Q with a common side always belong to 
opposite systems. 

27. There is a symmetric (3, 3) correspondence between A in opposite 
systems; two A, one of each system, correspond when they share a vertex. 
There is also a symmetric (4, 4) correspondence between A in opposite systems; 
two A, one of each system, correspond when the Q associated with them share 
a side. This latter is of course the same as the correspondence between A 
of opposite systems that do not share a vertex; such A are in a certain geo- 
metrical relation that will now be obtained. 

Let c be one side of a Q\ i\, iz, iz the vertices of ^ on c; i\, i ' 2 , i' 3 its opposite 
vertices. Each pair of opposite vertices is conjugate for x- The diagonal 
triangle A is ji jz jz where j\ is common to iz i' 3 and i'z iz, and so on. The polars 
jx i'ujz i'z, j 3 i'z of ii, iz, iz are concurrent at e, the pole of c\ the intersection 
e\ of c and j\ i'\ is the point on c conjugate to i\, and likewise for ez and ez. 
The 5-lines through j 1 join it to the i on jzjz and so meet c at i\, iz, iz, e\. 
Thus j 1 ez and j\ ez are c-lines as, likewise, are jz ez, jz Ci, j z Ci, jz e 2 , and the 
only c-lines through, say, e\ are e\jz, e\jz, iiiziz- 

Take, now, the other Q of which c is a side; it also has i\, iz, iz for vertices 
but has another diagonal triangle kikzkz ; and the only c-lines through e\ are 
eikz, e\kz, iiiziz • This implies, since ki, like j 1, is on ee\, that jijzjz and kikzkz 
are in perspective from e\. Similarly they are in perspective from ez and C3. 
And they are manifestly in perspective from e. 

Two A, in opposite systems and not sharing a vertex, are therefore in 
quadruple perspective. Their centres of perspective are all e-points and 
the polar of one of them contains the other three and is the commop side of 
the two Q associated with these A. 

Just as for q = 5, so for q = 7; the axes of perspectivity of two A can be 
displayed as factors of linear combinations of the products of their sides. 
The simplest such identities occur when one A is the triangle of reference 
and the other, to be selected from the lower stratum but not to be any of the 
first three blocks therein, is the A answering to the last block, since the product 
of its three sides is a symmetric function of the coordinates. The identities, 
over GF( 7), are 

(2x + Sy + 3z) (Sx -f- 2y -f- 3z) (3x -j- 3y -f- 2z) — xyz 

= 2(2x — y — z) (2y — z — x)(2z — x — y) , 

(2x + 3y -f- 3z) (3x + 2y 4- 3z) (3x 4- 3y -f- 2z) -f- xyz 

= — 3(x + y + z) (x 2 -f y 2 + z 2 + yz -f zx + xy). 

The centres of perspectivity of these two triangles are then (1, 1, 1) and the 
three e-points 

(2, -1,-1), (-1,2, -1), (-1, -1,2) 

on its polar. The Q associated with these A are that whose sides are 
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x + y + z = 0, y + z — x = 0, z + x — y = 0, x -j- y — z = 0, 
and that whose sides are 

x + y + z = 0, y + z + 2x = 0, z + x + 2y = 0, x + y + 2z = 0. 

Each of the 28 c-lines is a common side of two Q, whose diagonal triangles are 
in quadruple perspective in the manner described above; there are 28 pairs of 
identities of which the pair displayed is one, and the other 27 pairs are derivable 
from this one pair by applying the orthogonal transformations. 

28. The symmetrical (3, 3) correspondence between two sets of 7 objects 
must occur in any representation of Q + ( 3, 7). For Klein’s representation as a 
group of ternary substitutions over the complex field there occur (9a, p. 715; 
7, p. 443) two sets of 7 conics; every conic of either set meets Klein’s non- 
singular plane quartic in the 8 contacts of 4 bitangents, and the 7 sets of 4 
bitangents answering to the 7 conics of either set account for all 28 bitangents. 
Each bitangent belongs to one, and only one, quadruple of either set (9a, 
p. 712) and a symmetrical correspondence between the two sets of 7 conics 
is set up if conics, one in each set, correspond when the quadruples do not have 
a bitangent in common. 

But perhaps the representation of Q + (3, 7) that most simply displays the 
(3, 3) correspondence (although the two sets do not now consist of like objects) 
is the group of 168 projectivities of the 7-point plane «, a point and line corres- 
ponding when they are incident. In w each of the 7 lines contains 3 points and 
each of the 7 points lies on 3 lines. 

29. It was announced by Galois (6, p. 412) that LF( 2, q) has a permutation 
representation of degree q for q = 5, 7, 11; this is never so if q > 11. The 
isomorphic group 0*^(3, q) must therefore also admit such a representation 
in the finite plane; one has already been encountered for q = 5, 7, when the 
q objects permuted are canonical triangles: for q = 5 the whole set, for q = 7 
the members of either imprimitive system. And so the question is clamant: 
what geometrical entities supply a representation of 12+ (3, 11) as a permuta- 
tion group of degree 11? 

In the finite plane corresponding to q = 11 there are, as we shall see, 
Clebsch hexagons; hexagons, that is, endowed in 10 ways with the Brianchon 
property of concurrence of 3 diagonals. Given the conic x there are 22 Clebsch 
hexagons ^ all of whose vertices are e-points and diagonals 5-lines; each of 
the 66 e-points is a vertex of 2 ^ that belong one to each of 2 imprimitive 
systems of 11 ^ Either system supplies a representation of S2 + (3, 11) as a 
permutation group of degree 11. The operations of fi(3, 11) that are outside 
fl + (3, 11) transpose the 2 systems. 

30. Take x» x 2 + y 2 4- z 2 = 0, and the triangle of reference A. Suppose 
that an 5-line meets both y = 0 and z = 0 in points e neither of which is a 
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vertex of Q\ such points are (c, 0, 1) and ( b , 1,0) with b, c both marks of 
G.F(11) and both b 2 and c 2 neither 0 nor 1. Moreover, the points being e, 
neither b 2 + 1 nor c 2 + 1 can be a square. Since, in GE(ll), 

1, 4,-2, 5, 3 are the squares, 

and —1, —4, 2, —5, —3 the non-squares, 

b 2 and c 2 can only be —2 or 5. But the join 

x = by + cz 

is not an 5-line unless b 2 + c 2 + 1 is a square; this prevents b 2 -f- c 2 from 
being —4 or —1 and forces it to be 3; b 2 and c 2 are, in either order, the two 
marks —2 and 5, squares of ±3 and ±f. This yields 2 quadrangles whose 4 
vertices are e and 6 joins s, two of the joins being y = 0 and 2 = 0; one has 
vertices (±3,1,0) and (1,0, ±3); the other has vertices (1, ±3,0) and 
(±3,0,1). 

Consider now the first of these quadrangles. Through each vertex pass, in 
addition to its joins to the other vertices, 2 further 5-lines; the 8 5-lines so 
arising are found to meet 4 at each of 2 e-points on x — 0 and these 8 5-lines, 
with x = 0 and the 6 joins of the quadrangle, are the 15 joins of 6 e-points, 
namely of 

3 10-310 

30.1 1 0 3 1 0 -3 

0 3 10-31. 

Verification is immediate. And the 15 5-lines are sides of the following 5A: 
A 0 : xyz — 0 

Ai: ( 5x — 4y — 2z)( — 2x + 5y — 4c) ( — 4* — 2y + 5z) = 0 

30.2 A 2 : ( 5x + 4y + 2z) ( — 2x — by + 42) { — 4x + 2y — 52) = 0 

A 3 : ( — 5x — 4y -f 2z) ( 2x + 5y + 42) ( 4x — 2y — 5 z) = 0 

A 4 : ( — 5x -f* 4y — 2z) ( 2x — 5y — 4 2 ) ( 4x ± 2y + 52 ) = 0. 

Denote by the hexagon whose vertices are 30.1. Each A^ in 30.2 answers 
to a syntheme of vertices of » ; the 5 synthemes, one for each A jt constitute a 
synthematic total T. Each of the 10 synthemes extraneous to T provides 3 
pairs whose joins concur: ^ has the Clebsch property. The concurrencies are 
all at points i, and normalized coordinate vectors for them are 

0530 -5 3 -2 222 

30.3 30530 -5 2 -2 22 

5 3 0 -5 3 0 2 2 -2 2. 

Each of these points is the concurrence of sides of 3 of the 5 A ; ; and each of 
these 3 sides is an axis of perspective of the 2 remaining A^. The triple per- 
spectivity of A 0 and Ax accords with the identity, over GE(ll), 

(5* — 4y — 22 ) ( — 2x + by — 42) ( — 4* — 2y + 5 2 ) + xyz 

= 4(5* + 4y + 22 ) (2x + 5y -j- 4z) (4x + 2y + 52 ). 
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Other identities derived from this by imposing the orthogonal transformations 
exhibit other pairs of canonical triangles in triple perspective. 

One may, in passing, note the relation between the Q associated with such 
triangles; they are found to have a common side, the line on which the 3 
centres of perspective of their diagonal triangles lie, and their 2 sets of 3 
vertices thereon account for all e-points on the side. 


31. The elliptic involution centred at a vertex of A 0 leaves A 0 invariant while 
transposing the other A j as two pairs; the analogous situation holds for the 
involution centred at any vertex of any A } , and the A } thus undergo the 15 
even permutations of period 2 of the alternating group Els. The 15 involutions 
all belong to J2 + (3, 11) and generate a subgroup thereof; this is icosahedral, 
being isomorphic to 21s because any projectivity which imposes the identity 
permutation of the A } must impose it on the points 30.3 (each of which is 
determined by those 3 A^ whose sides intersect there) and so be the identity 
projectivity. 

^ is not invariant for the whole group S2 + ( 3, 11), it is changed to other 
hexagons by the involutions centred at the points 30.3; the subgroup for which 
it is invariant is thus a maximal icosahedral subgroup of order 60, and ^ is 
one of 

660 ^ 60 = 11 

Clebsch hexagons permuted transitively by fi+( 3, 11). The other 10 are 
obtained at once by imposing the involutions centred at the points 30.3; 
taking, for example, the last of these points x = y = z we have 


[ 


-4 -3 -3 
-3 -4 -3 
-3 -3 -4 


3 1 
1 0 
0 3 



The vertices of the 11 ^account for all 66 e-points, and the A } which belong 
5 to each account for all 55A. 


32. The 11 ^only provide one half of the figure; there is a second set of 11 
Clebsch hexagons ^ equally well supplying a permutation representation. 
These are obtained by starting, instead of from 30.1, from 

1301-30 

3 0 1 -3 0 1 

01301-3 

which affords, by a synthematic total of its vertices, the 5 A got by transposing 
y and z throughout 30.2. This transposition is effected by using the involution 
whose matrix is 

-1 0 ol 

0 0 1 ; 

.0 1 0 I 
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its centre (0, 1, 1) is an e-point so that it is hyperbolic and, although belonging 
to 12(3, 11), it does not belong to ft + (3, 11). Thus Q( 3, 11) is transitive on 22 
Clebsch hexagons ^ and these form imprimitive systems for 12 (3, 11) and 

each set of 11 is a transitive set for Q+(3, 11). 

Each e-point is a vertex of a single ^ and a single there is a symmetrical 
(6, 6) correspondence between the and ^ wherein corresponding hexagons 
share a vertex. Alternatively, one may use the symmetrical (5, 5) corres- 
pondence wherein corresponding hexagons do not have a vertex in common. 
These correspondences between 2 sets of 11 objects will occur in other re- 
presentations of ft + (3, 11). Klein, in 1879, found a representation as a group 
of quinary linear substitutions over the complex field, and when the 5 variables 
on which the substitutions operate are used as homogeneous coordinates in 
[4] there do occur two sets of 11 quadrics with each quadric of either set 
t linearly dependent on 5 of the other (9b, p. 429). 


References 

1. W. Burnside, Theory of groups of finite order (Cambridge, 1911). 

2. A. Clebsch, Ober die Anwendung der quadratischen Substitution auf die Gleichung Blen 

Grades und die geometrische Theorie des ebenen Fiinfseits, Math. Ann., 4 (1871), 284-345. 

3. L. E. Dickson, Linear groups, with an exposition of the Galois field theory (Leipzig, 1901). 

4. W. L. Edge, Geometry in three dimensions over GF{ 3), Proc. Royal Soc. A222 (1953), 

262-286. 

5. , 81-point geometry, Math. Gazette, 89 (1955), 113-121. 

6. E. Galois, Lettre de Galois & M. Auguste Chevalier, J. de math, pures et appliqu&s, 11 

(1846), 408-415. 

7. F. Klein, Vber die Transformation siebenter Ordnung der elliptischen Funktionen, Math 

Ann., U (1879), 428-471. 

8. , Vorlesungen fiber des Ikosaeder (Leipzig, 1884). 

9a, b. F. Klein and R. Fricke, Vorlesungen fiber die Theorie der elliptischen Modulfunktionen 
(Leipzig, 1890 and 1892). 

10. B. Qvist, Some remarks concerning curves of the second degree in a finite plane. Annales 

Academiae Scientiarum Fennicae A., No. 134 (1952). 

11. J. A. Serret, Cours d'alglbre supSrieure (5th ed.), tome 2 (Paris, 1885). 

University of Edinburgh 


Copyrighted material 


CONVEX HULLS OF SIMPLE SPACE CURVES 


DOUGLAS DERRY 

1. Introduction. The convex hull of an arbitrary set M in real Euclidean 
n-space is known to consist of all the points within the r-simplexes with r + 1 
vertices from M, r < n. This note shows that if M is specialized to be a curve 
A n of real order n, then its convex hull consists of all the points within the 
r-simplexes with r + 1 vertices on A n , n = 2r + 1 or n = 2r. In the first 
case each interior point is within exactly one simplex. This result was given by 
Egerv&ry (1) for n = 3. If n is even each interior point of the convex hull 
of A n is within a 1-parameter system of ^»-simplexes. The class of curves A n 
includes the twisted w-ics, the convex hulls of which have been studied by 
Karlin and Shapley (2). Some of their results are consequences of the present 
results. 

2. Some definitions. A curve A„ is defined to be a 1-1 continuous mapping 
in real Euclidean «-space of all the real numbers s computed modulo 1 or of 
the interval, 0 < 5 < 1, which satisfies the order condition that no hyperplane 
contains more than n points of A n . 

The order condition implies that any linear fc-space, 0 < k < n, cannot 
contain more than k + 1 points of A n . If a hyperplane H supports A n at an 
inner point s' then s' is defined to have multiplicity two within H. By displacing 
the hyperplanes it is possible to show that the sharpened order condition 1 
holds that no hyperplane contains more than n points of A n if each point is 
counted with its proper multiplicity of one or two. 

The symbol [A,B,...\ denotes the intersection of all the linear spaces 
which include the point sets A, B, . . . , while {A, B, . . .) denotes the convex 
hull of the union of the point sets A, B, . . . . Two sets A and B are said to be 
separated by a hyperplane H provided A is in one of the closed half spaces 
bounded by H and B in the other. 

3. The boundary of A n . The following lemma is stated without proof. 

Lemma 1. If a hyper plane H supports a compact set X , then {iJr'l.X'} = 
HC\ {X}. 

Theorem 1. The boundary of {^4 n } consists of all the points within all the 
q-simplexes for which the vertices are q + 1 points of A„ including e endpoints , 
2q < n — 2 + e, {e = 0, 1, 2). 

Received October 14, 1955. 

>1 should like to thank the referee for the improvements he suggested and in particular for 
pointing out that the above form of the sharpened order condition, which makes no use of 
differentiability, was sufficient for the results of this paper. 
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Proof. If P be a boundary point of \A n \ , a hyperplane H exists which 
supports A n and contains P. Let So. $1, . . . , s q be the distinct curve points in 
H r\ {i4 n }. Because of the order condition, {so, $i, . . . , $«} is a g-simplex. 
By Lemma 1, 

P € H r\ \A n \ = {H A A n \ = {so, 5i, . . . , 

As H supports A n an interior point of A n must be included in H twice. 
Consequently if e denotes the number of endpoints of A n in H, it follows 
from the order condition that 

e -f- 2(g + 1 - e) < n or 2g < w - 2 + e. 

Thus each boundary point of {>!„} is within a 5-simplex, with the required 
properties. 

Conversely let P be a point of a g-simplex {s 0 , $1, . . . , for which 2q < n — 
2 -f Then a hyperplane exists which contains P and supports A n . To 
construct such a hyperplane, for each point s t interior to A n , let N t be an arc 
s' t <s<s t and if A„ is not closed let N', N" be neighbourhoods of the end- 
points 0, 1 respectively. Let H be a hyperplane which contains n points of A n 
including all s t , s'i and so that the remaining n — 2 (q + 1) + e curve points 
within H are distributed among the arcs N u N\ N " in such a way that no 
arc 2V< contains an odd number of these points. This distribution is always 
possible because if A n is closed n is even and e = 0. If Ni—*s u N' —* 0, 
N " — * 1 then any limiting position of H contains P and supports A n . As 
P 6 { Sot s s ff } C [A n ], P is a boundary point of {^4 n } . The proof is 
now complete. 

4. The structure of f^4 n }* If 2r = n or 2r -f- 1 = n, S T is defined to be 
an r-simplex with interior points of A n as vertices except for even n when at 
most one of the vertices may be an endpoint of A n . 

Theorem 2. The interior points P of {^4 n } consist of all the interior points 
of the simplexes S T . 

For odd n, S T is uniquely determined by any one of its interior points P; for 
even n, S r is uniquely determined by an interior point P and any one vertex which 
can be either endpoint of A „ or any arbitrary point of A n if it is closed. 

Proof. We show first that every interior point P of a simplex S T is an in- 
terior point of {^4 n }. As S T C {A n \ it will be sufficient to show P is not a 
boundary point of f ■/!„}. Let e be the number of vertices of S r which are 
endpoints of A n . If P were a boundary point of {A n } it would be within a 
hyperplane H which would support {A n }. H would also support S r and con- 
sequently, as P is an inner point of S T , S T Q H. Therefore IT would contain 
2(r + 1 — e) + e points of A n . This would contradict the order condition 
as, by the definition of S r , e = 0 if n = 2r -f- 1 and e < 1 if n = 2r. Hence 
the inner points of the simplexes S r are all inner points of [A n \. 
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We next show that a given interior point P of [A n ] is an interior point of a 
simplex S T . Let a be any real number if A n is closed and 0 if A n is open. Denote 
by A (a, s') the arc of points s, a < 5 < s'. Let s P be the least upper bound 
of all s' for which P $ {A (a, P)}. 

We prove that P 6 {A(a, s P )}. If this were false, P and { A{a , s P )} would 
be separated by a hyperplane at a positive distance from [A (a, 5 P )}. This 
hyperplane would also separate A {a, s') and P for s' > s P provided s' were 
sufficiently close to s P . Consequently P $ [A {a, s')} contrary to the choice 
of s P . 

P is on a supporting hyperplane of [A {a, 5 P )}. To prove this let s M be an 
increasing sequence which converges to s P . Because P $ [A (a, Sy)} a hyper- 
plane Hy exists which supports {A(a, s M )| and contains P. Sy can be chosen 
so that Hy converges. If H be its limit then P £ H and H supports {A (a, Sy ) } . 
But, as Sy is arbitrary, H supports {A (a, s P )}. From this result, together with 
the fact that P £ {A ( a,s P ) } , it follows that P is a boundary point of [A ( a,s P ) } . 

Consequently, by Theorem 1, a simplex S 9 exists which contains P, has 
vertices on A ( a , s P ) and for which 2q < n — 2 + e, where e is the number of 
vertices of S 9 which are endpoints of A (a, s P ). The vertices of S t are also on 
A n . Let e' be the number of these vertices which are endpoints of A n . As P 
is not a boundary point of {A n \, 2q > n — 2 -f- e' . Therefore e' < e and so 

0 < e. If A n is open, e' = e — 1 as 0 is a common endpoint of A n and A (a, s P ). 

The two inequalities yield the result 2q = n — 2 -f- e. Hence, if n = 2 r, 
then e — 2 and q = r and, if n = 2r + 1, e = 1 and q = r. If A n is closed n 

is even and e' = 0. In this case the inequalities show e = 2 and r = q. 

P cannot be a point of a face of S T for such points, by Theorem 1, are bound- 
ary points of {A r \. Therefore P is an interior point of the r-simplex S T which 
satisfies the requirements of the theorem as e' = 0 for odd n and e' < 1 for 
even n. This completes the proof of the first part of the theorem. 

For even n, e = 2 and consequently a is a vertex of S T . If A n is closed a is 
arbitrary and so in this case, for a given P, an S T exists with an arbitrary 
vertex. If A n is open a — 0. After a reversal of orientation of the points on the 
curve, the other endpoint of A„ can be represented by the number 0. Therefore 
S r can be chosen so that either endpoint of A n is a vertex provided n is even. 

Suppose now P is a point within two distinct simplexes with vertices 
So, Si, , s T ; s' o, s' i, . . . , s' T and that P is not in a face of {s 0 , Si, . . . , s r }. 
Let k, 0 < k < r, be the number of vertices common to both simplexes. 
It follows, with the use of the Steinitz replacement theorem, that the space 

[so, s i, . . • , S T , So, Si, ...,s r\ 

has dimension at most 2r — k. It contains 2(r + 1) — k points of A n . This 
leads to a contradiction of the order condition unless 2r — k = n in which 
case k = 0 and n = 2r. This proves, for odd n, that P is within only one 
simplex S T and, for even n, that P is never in more than one simplex S T with a 
given vertex. The proof is now complete. 
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Corollary. Every point P in the interior of {A 2t } is an interior point of 
each of two suitably chosen simplexes S r , S' T which have no common vertex. 

Proof. If Air is open each interior point P of {A 2r } is, by the Theorem, 
interior to a simplex S r ( S ' T ) with the endpoint 5 = 0, (s = 1) as a vertex. 
If S T , S' T were to have a common vertex then, by the Theorem, they would be 
identical and both endpoints of A 2r would be vertices in contradiction to the 
definition of the simplexes. If Air is closed the result is clear. 

Lemma 2. If the vertices of two r-simplexes S r , S' T which have no common vertex 
are all on A 2t and if an arc of A 2t exists which contains two vertices of S r and no 
vertex of S' T , then S r , S' T have no point in common. 

Proof. Let s 0 , $i, . . . , s T , s 0 < si < . . . < s r < s 0 + 1 ( = s r+ i) be the 
vertices of S r . By the hypothesis an arc s k < 5 < s*+i exists which contains 
no vertex of S' r , 0 < k < r, if A 2r is open and 0 < k < r, if A 2t is closed. 
In the latter case the coordinates may be adjusted so that 0 < k < r. As 

S r , S'r have no common vertex, distinct curve points t'\, t\ t' T , t T of A 2t 

exist so that 


t'i ^ So ^ t\ < t'i ^ s 2 ^ t 2 < . . . < t' k +\ ^ s k < Sjt+i ^ /*+i < . . . 

< t' T <S r <t T <t' i+l 


and so that none of the arcs t\ < s < t u 1 < i < r, contains a vertex of S' T - 
Let H be the hyperplane [t\, t\, ... , t' r , t T \. As H intersects A 2t only in the 2r 
points t'i, t u 1 < i < r, all the points of the arcs /'< < 5 < t t , 1 < i < r, 
are either on H or on the same side of H while all the points of A 2r not within 
the above arcs are on the opposite side of H. Thus H separates the vertices 
of S r from those of S' r • Furthermore all the vertices of S' r are at a positive 
distance from H. Hence S T and S' T have no points in common. The Lemma is 
now proved. 

Convex hulls are defined for affine space. The following result shows that the 
convex hull [A 2r } can be defined in terms of projective concepts. 


Theorem 3. If So, s i, ... , s r ; s' o, s' i, ..., s' T are curve points of A 2t for 
which 


0 < So < s' 0 < *1 < • • • < s T < s'r < 1, 

for open A 2t and 

So < s'o < Si < . . . < S T < s'r < S 0 + 1 (= S r+ i) 
for closed A 2t then the interipr of {A 2r } consists of all the intersections 

[Sq, Sj, • • « , Sf] [s o, S x, . . . , S r]. 


Proof. Let P be a given point in the interior of {A 2t }. By the Corollary 
to Theorem 2, simplexes S r , S' r exist, without a common vertex, both of which 
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contain P as an interior point. Let Jo, Ji, . . . , J r , 0 ^ < • • • < s T < 

So -f" 1 be the vertices of S T . As S T , S' T have the common interior point P 
it follows from Lemma 2 that each arc j< < s < s t +i, 0 < i < r, contains 
exactly one vertex of S' T . Therefore if j'o, s' i, . . . , j' r be the vertices of S' T 
the subscripts may be adjusted so that, for closed A 2r , 

Jo < s'o < Si < . . . < j' r _i < j r < s' T < So + 1 

and, for open A 2rt either 

0 < Jo < j'o < Ji < • . . < Jr < s', < 1 


or 

0 < j'o < Jo < • • • < j'r < Jr < 1. 

As P is a common point of the simplexes 

P 6 [jo* Jl» • • • i Jr] ^ [j Oi J 1, . • • , J r]. 

Now let Q be any point of [jo, Ji, . . . , J r ] C\ [j'o, j'i, . . . , j' r ] where 

JOi Jl> • • • » Jr> J Oi J li • • • i J r 

are points of A 2r which satisfy the inequality system. The r-spaces [jo, Ji, 
. . . , j r ], [j'o, j'i, . . . , j' r ] must have at least one point in common as 2r = n. 
They cannot have more than one point in common for then 

[jo, Ji, • • . , Jr, J Oi • • • i J r] 

would have dimension at most 2r — 1 and contain 2r -f- 2 points of A 2r , in 
contradiction to the order condition. 

Q cannot be a point on a proper face of either simplex { j 0 , Ji, . . . , J r }» 
[j'o, j'i, . . . , j'r}. Suppose, for example, Q to be within the face [jo, Ji, . . . , 
Jr— i } . Then the space 

[JO. Jl» • • • » Jr— 1> J Oi ■ • m J r] 

would have dimension at most 2r — 1 and contain 2r + 1 points of A 2r in 
contradiction to the order condition. 

If Jo, Jj, . . . , J r , j'o, . . . , j' r move continuously so that the inequalities are 
always satisfied, Q is uniquely defined and moves continuously. We know, if 
Q = P, that Q is interior to {^4 2r } as well as to both simplexes { Jo, Ji, . . . , s T ] , 
[j'o, j'i, . . . , j'r}. As Q cannot enter a proper face of either of these simplexes 
it must remain in the interior of both of them. Q cannot enter the boundary of 
{A 2r }. For otherwise it would be in a hyperplane H supporting {A 2r } and 
consequently supporting { j 0 , Ji, . . . , J f }. As Q is an interior point of the simp- 
lex, [j 0 , Ji, . . . , J r ] ^ H. It follows from the inequality system that at most one 
vertex of { Jo, Ji, . . . , J r } is an endpoint of A 2r . Hence H would contain at 
least 2(r -f- 1) — 1 = 2r + 1 points of A 2t in contradiction to the order 
condition. Therefore Q must always remain in the interior of {A 2r }. The proof 
is now complete. 
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CONSTRUCTIONS IN HYPERBOLIC GEOMETRY 


FRANS HANDEST 

Introduction. In hyperbolic geometry we have three compasses, namely 
an ordinary compass for drawing ordinary circles with a given centre and a 
given radius, a hypercompass for drawing hypercycles with a given axis and 
a given radius, and a horocompass for drawing horocycles with a given dia- 
meter and passing through a given point. 

NestoroviS has proved that everything that can be constructed by means 
of one of the compasses and a ruler, can be constructed by means of either of 
the other compasses and a ruler (6; 7; 8; 9). Another important result we want 
to use in the following is a theorem by Schur concerning ruler constructions. 
Schur proved that even if we are only able to perform constructions in a 
finite part ft' of the projective plane, we are also able to carry out constructions 
in the entire plane. A point is then said to be constructed if it is determined 
as the intersection between two lines in ft'. A line is said to be constructed if 
there are constructed two points on the line ( 11 , pp. 15-22; see also 13 ). 
Another theorem we shall use is: To a given right-angled triangle {a, b, c, 
A,B) (i.e., a right-angled triangle with hypotenuse c, catheti a and b, and 
opposite angles A and B) there corresponds a second right-angled triangle 1 
{A(^7r — A ), a, A (B), n(c), — IT (6)} and using the same transformation 

on this triangle we obtain a third right-angled triangle and so on. Triangle 
number six is identical with triangle number one. This sequence of five tri- 
angles is called the Engel Chain (5, pp. 40-41). 

In this paper, we consider the following instruments: parallel-ruler, ruler, 
compass with fixed adjustment, and hypercompass with fixed adjustment. 

1. The parallel-ruler. A parallel-ruler is, as in Euclidean geometry, an 
instrument for drawing a line through a given point and parallel to a given 
line. We shall also, as in Euclidean geometry, use the parallel-ruler as an ord- 
inary ruler. 

Theorem 1. Any construction in hyperbolic geometry that can be performed 
by means of a ruler and any of the three compasses , can be performed by means of 
a parallel-ruler. 

Let the hyperbolic plane be the interior of the “absolute” conic ft situated 
in the real projective plane. If U and V' are two points 2 on ft determined by 

^he angle ir(p) is the angle of parallelism for the segment of length p. If A = ir(p) then 
P = A (A). 

2 In the following, U and V(Ui, Fi, U', V' and so on) will always be points on fi. If a line 
intersects Q at U (or Ui,U\, . . .) then its other end is called F (or Fi, V \, . . .), unless otherwise 
indicated. 
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the lines u and u' , we can always, by means of the parallel-ruler, draw the lines 
A V ' and BU where A and B are two arbitrary points on u and u', respectively, 
neither of them being the point of intersection u-u'. The points U and V 
are now determined by pairs of lines. This means, according to the result of 
Schur, that we are able to join two points on ft and to perform ruler con- 
structions in the entire projective plane, operating only inside a finite part 
ft' of the hyperbolic plane. Of course we have to choose ft' so that it contains 
parts of the lines determining the points on ft. Consequently it is possible 
to make the following constructions: 

1.01. Given a segment OA on one arm of an angle V'OV, construct OB = OA 
where B is on the other arm of the angle. 

Draw UU' and VV. Through their intersection draw a line through A. 
It will meet U'V' at B. 

Proof. UOA V 77 U'OB V' and since U —*■ U' and V — > V' , the perspectivity 
is a congruent transformation that takes OA to OB. 

1.02. Given a segment OA, construct C on the line OA so that OA — OC 
(A ^ C). 

Draw any line JJ\V\ (j^OA) through A. Draw U\0 and ViO and call their 
other ends (i.e., intersections with ft) U\ and V\, respectively. Then U\V\ 
intersects OA at C, and OA = OC. 

1.03 Given an angle V'OV, construct its internal bisector. 

Construct A and B on OV and OV r , respectively, so that OA = OB (1.01). 
A V' and B V will intersect at a point of the angle bisector. 

1.04 Given a line UV and a point P not on the line, construct the perpendicular 
line to UV through P. 

Bisect the angle UPV (1.03). The angle bisector is perpendicular to UV. 

1.05 Given an angle VOV' , construct an angle VOV" = 2 • VOV'. 

Take a point P on OV and construct the symmetric point P' to P with 
respect to OV' (1.04 and 1.02). Then VOP ' = 2 • VOV'. 

1.06 Given a line l and a point P on l, construct a line n perpendicular to l 
through P. 

Draw any ray (not contained in /) beginning at P, and double the angle 
around l (1.05). Construct the internal bisector n of the supplement of the 
double angle. 

1.07 Given a segment AB — p, construct II (p). 

Construct the perpendicular line / to AB at A (1.06) and draw a parallel line 
to l through B. 
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1.08 Given a segment AB and a point A' f both on a line UV, construct B' on 
UV such that AB = A'B'. 

Take any line UiVi UV) through A. If U 2 is the other end of U\B and 
V 2 is the other end of V\A' then B' is UV - U 2 V 2 . 

Proof. Use Andrianov’s theorem (1; for a more elegant proof see 2): Let 
the four sides of a quadruply asymptotic crossed quadrangle meet an arbitrary 
transversal in points A, C, B, D; then BC and DA are congruent segments. 

1.09 Given a segment AB and a ray V starting at A', construct B' on l', such 
that AB = A'B'. 

Construct B\ on AA' such that AB = AB\ (1.01) and the point B 2 on AA’ 
such that ABi — A'B 2 (1.08), and finally the point B' on V such that 
A'B 2 = A'B ' (1.01). 

1.10 Given a segment AB, construct the mid-point M. 

U A iA and BB\ are equal and both perpendicular to AB at A and B, 
respectively (1.06 and 1.09), and A \ and B\ are on opposite sides of AB, then 
A\B\ intersects AB at M. 

1.11 Given two segments a and c (a < c), construct a right-angled triangle 
with hypotenuse c and cathetus a. 

Construct II (c) (1.07) and use II (c) and a to construct the second triangle 
of the Engel chain (cathetus a fixed) (1.09 and 1.06), so as to obtain B as the 
angle of parallelism of the hypotenuse (1.07). Construct then the right-angled 
triangle containing this angle B and the adjacent cathetus a (1.09 and 1.06) 
The hypotenuse is c and the required triangle is constructed. 

1.12 Given a point O, a segment r — AB and a line l intersecting the circle 
0{r), construct the points of intersection. 

Construct OO x perpendicular to l (with O x on 1) (1.04) and the right-angled 
triangle with hypotenuse AB and cathetus 00\ (1.11). The other cathetus 
OiC can now be moved to l (1.09). Ci and C 2 (where 0\C — 0\C\ = OiC 2 ) are 
the intersections. 

1.13 Given two points 0 1 and 0 2 and two segments of length r 1 and r 2 , construct 
the intersections of the circles Oi(r x ) and 0 2 (r 2 ). 

Let d denote the distance 0\0 2 , and b the distance from O x to the intersection 
of 0\0 2 and the radical axis; then 

, , cosh r x cosh d — cosh r 2 

tanh b = . 

cosh r 1 sinh d 

The segment b can be constructed in the following way : Construct 0\Of = r x 
and 0 2 0 2 = r 2 , both perpendicular to 0\0 2 (1.06 and 1.09), with Of and 0 2 
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on the same side of 0i0 2 . Construct the mid-point M of 0/0 / (1.10) and 
construct the line m perpendicular to 0/0/ at M (1.06). Let m meet 0\0 2 at 
A ; then A0 2 = b. 

Proof. 0/A = AO/ . \i A0 2 = x, so that OiA = d — x, then 

, cosh r\ cosh d — cosh r 2 

tanh x = ; . 

cosh r i sinh d 

Since tanh is a single-valued function, we have x = b and 1.13 reduces to 

1.12 ( 8 ). 

Any construction that can be performed by means of a compass and a ruler 
can then be performed by means of a parallel-ruler, and this result, along with 
the theorem of NestoroviC, proves Theorem 1. 

2. Analogues of Steiner’s construction 

Theorem 2. Any construction that can be performed by means of any of the 
three compasses and ruler , can be carried out with the ruler alone if there is drawn 
somewhere in the plane (i) a circle with its centre and two parallel lines, or (ii) a 
hypercycle with its axis and two parallel lines with their common end not on the 
axis, or (iii) a horocycle with one diameter and two parallel lines with their 
common end not at the centre of the horocycle (12). 

(i) Let £2 again be the absolute conic, co the given circle with centre A, and 
P the common end of the two given parallel lines. We want to prove that if 0 
is any given ordinary point and l is any given line, we are able to construct 
the parallels from O to l. When this is proved, Theorem 1 will give us Theorem 
2(i). Let £2' be a finite part of the hyperbolic plane containing co, 0, a part of 
l, and a part of the two lines that define P. By means of two harmonic con- 
structions, we can obtain the polar a of A with respect to co. This is also the 
absolute polar of A (i.e., the polar with respect to £2). The construction can be 
carried out by using the ruler only inside £2'. Join P and A and let Q be one of its 
intersections with co. The homology H, with axis a, centre A, taking Q to P, 
will take co to £2 (4, pp. 173-174). 7f -1 will then take £2 to co. 

Construct now the images O' and V of 0 and l in the homology H~ l . Join O' 
to the intersections, Pi and P 2 , of co and V , and construct the images of 0'P\ 
and 0'P 2 in the homology 77. These lines are the parallel lines desired. 

(ii) . Given a hypercycle co, with axis a, and two parallel lines with end P 
(P not on a), we can again use two harmonic constructions to obtain the pole 
A of a with respect to co. The constructions can be carried out by using the 
ruler inside a suitable finite part £2' of the hyperbolic plane. The point A is also 
the absolute pole of a. Let ^4P intersect co at Q, as before. The homology 77, 
with axis a, centre A, taking Q to P, will take co to £2. Using the same principle 
as above, we are able to construct a line through a given point parallel to a 
given line. This proves Theorem 2 (ii). 
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(iii). In the third case, where we have a horocycle a> with centre A, the 
homology is an elation. But here, the centre A is not a given point. To deter- 
mine A, we have to construct a second diameter of the horocycle. This can be 
done as follows: Let B be the ordinary end-point of the given diameter d , 
and let F be any other given point on co (neither B nor A). Choose on the 
conic three distinct points C, D , E, none of them coincident with B or F, and 
let l be the join of the intersections d-DE and BC-EF. Join F to the inter- 
section l -CD. Since this is a line passing through A, it is a diameter. For, l is 
the Pascal line of the hexagon ABCDEF. 

The centre A is now determined by two parallel lines. The tangent a to co 
can be constructed as the Pascal line of the hexagon A ABODE. This is also 
the tangent at A to SL All the above constructions can be performed by ruler 
inside a suitable finite part of the hyperbolic plane. The elation, with centre 
A and axis a, taking Q to P (where P is the given end and Q an intersection 
of AP and w), plays now the same role as the homology H in (i) and (ii). 

As shown by Obl&th in connection with Steiner constructions (10; for a 
more elegant proof see 3), it is sufficient if we are given only an arc, however 
small, of the circle, hypercycle, or horocycle. Hiittemann’s proof, being 
projective, is valid here. 

3. Compasses with fixed adjustment 

Theorem 3. Every construction that can be performed by any one of the three 
compasses and ruler can be performed by either (i) a compass with fixed adjust- 
ment and a ruler or (ii) a hypercompass with fixed adjustment and a ruler. 

If we can prove that by means of our instruments we are able to construct a 
pair of parallel lines, then Theorem 3 will follow from Theorem 2. 

(i) Draw a circle co with centre A and a diameter l. Construct (by means of 
two harmonic constructions) the pole L of l with respect to co. This is also the 
absolute pole of /. Given a point P either on l or outside /, PL is then per- 
pendicular to l. All the constructions can be performed inside a suitable part 
Q! of the hyperbolic plane. 

The usual parallel construction can now be carried out, taking the arbitrary 
radius to be the radius given by the adjustment. 

(ii) . Perpendicular lines can be constructed in the same way as in (i), using 
a hypercycle instead of a circle. 

Two parallel lines can be constructed in the following way: Draw an acute 
angle AOB and construct on OA a point A i so that OAi is equal to the adjust- 
ment of the hypercompass. Construct l perpendicular to OB at 0 and /i 
perpendicular to l through Ai. Let the hypercycle with axis OB intersect l\ 
at S. The line m perpendicular to OB through 5 is parallel to OA. As a matter 
of fact, this is only the usual parallel construction here performed by a hyper- 
compass instead of the ordinary compass. 
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4. The common perpendicular to two skew lines. Finally we wish 
construct (e.g., by means of ruler and compass) the common perpendicular 
to two skew lines in hyperbolic 3-space. 

Let the given lines be g and g'. Take an arbitrary point A on g and con- 
struct the two lines AEi and AE 2 , where Ei and E 2 are the ends of g'. On g, 
construct points Mi, M 2 , such that M f Ei is parallel to AE t and perpendicular 
to g(i = 1 or 2). If M is the mid-point of M\M 2 and MN is perpendicular to 
g', then MN is the required common perpendicular. 

Proof. Project the whole figure on the plane Ng. If the projections of E\ 
and E 2 are Fi and F 2 , respectively, then MM\F\N = MM 2 F 2 N and therefore 
Z MiMN = Z M t MN = \tt. 
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LOCAL CONNECTEDNESS OF EXTENSION SPACES 


BERNHARD BANASCHEWSKI 

1. Introduction. An extension E* * of a topological space E (that is, a space 
containing £ as a dense subspace) 1 determines a family of filters @(m) on E, 
given by the traces U C\ E of the neighbourhoods U C E* of each u 6 E*—E. 
Many topological properties of an extension E* of a given space E can be 
related to properties of these trace filters (as we shall call them) belonging 
to E*. In this respect, the following condition for filters 21 has proved to be of 
some interest: 

(C) If OUP 6 21, 0 and P disjoint open sets, then either 0 6 21 or P £ 21. 

If, for instance, the trace filters of a locally connected extension E* of a 
simply connected space E fulfil (C), then E* is also simply connected (2). 
This statement involves previous knowledge of the local connectedness of E*. 
In the present note, a simple characterisation in terms of trace filters will be 
given for the local connectedness of extension spaces whose trace filters satisfy 
condition (C). This will then enable us to show that certain types of extensions, 
amongst them the Cech compactification of locally compact spaces which are 
denumerable at infinity, can never be locally connected. 

2. The principal result. A filter 21 on a topological space E is called open, 
if it has a basis consisting of open sets. Open filters for which condition (C) 
holds we shall call connected. As one can readily see, condition (C) for open 
filters is an extension of the concept of connectedness from open sets to open 
filters. If E* be an extension of E, to any open set 0 C E let 0 be the set of 
all points u 6 E* — E which satisfy 0 € ©(«), <S(w) being the trace filter 
belonging to m. OU 0 is open in E*. There is at least one open 0* C E* for 
which O = O* C\ E. Obviously, one has O* C O 0 , hence it follows that 
each x 6 0 is an interior point of 0 U 0 (in E*). Furthermore, to each u € 0 
there exists, by definition, an open neighbourhood V in E* for which V C\ E C 
O, and again one has V C O \J 0 . 

The passage from 0 to 0 U 0 will be used as the main device in proving 
the following proposition: 

Let E* be an extension of E each of whose trace filters is connected. Then E* 
is locally connected if and only if E is locally connected and each trace filter has a 
basis consisting of connected open sets. 

Received January 4, 1956. 

*A11 topological concepts are used in the sense of Bourbaki (3). All spaces considered here are 
assumed to be separated ( = Hausdorff) spaces. 
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If £ is locally connected and the trace filters ©(w) have the stated property, 
it is quite obvious that E* is locally connected: To each neighbourhood W 
of u £ E*, there is a connected open set (in E) V C W E in ©(«) or in the 
neighbourhood filter 33 («) of u in E if u £ E, and therefore an open neighbour- 
hood U of u in E* for which U r\ E = V and U C W. As V is dense in U , 
U is also connected. 

Now, let E* be locally connected and U any open set (in E) from ©(a). 
Then there exists a connected open neighbourhood (in E*) FC U U of u 
(as U\J U is open, hence a neighbourhood of u ) and W = V E C U 
belongs to ©(a). The set W 'U W contains V and, apart from that, only 
adherence points of V ; therefore, the connectedness of V implies that of 
W\J W. Supposing there were a decomposition W = OKJ P of W into disjoint 
open sets 0 and P in E. Then it would follow that W = 0 KJ P, because, by 
hypothesis, 0 U P ( ©(») implies 0 6 ©(*>) or P £ ©(i>) for any v 6 E* — E. 
This would, however, mean that (O 0) (P KJ P) is a decomposition of 
W\JW into open disjoint sets, in contradiction to the connectedness of 
W \J W. Conseqently, W is a connected set, and as W C JJ and U was 
arbitrary, this shows ©(a) has a basis consisting of connected open sets. 
The same argument applied to the neighbourhood filter 33 (x) of each x 6 E 
(instead of the ©(a)) proves that each 33 (x) also has a basis consisting of 
connected open sets, or, in other words, that E is locally connected. 

As we have proved recently (2) the maximal open, the maximal regular 
and the maximal completely regular filters of a space E are connected filters. 
It is well known that the non-convergent filters in each of these categories 
form the set of trace filters of certain extensions of E: the maximal open 
filters correspond (4) to Katetov’s maximal Hausdorff extension kE of E\ 
the maximal regular filters, in the case of a regular E, to Alexandroff’s (1) 
extension aE\ the maximal completely regular filters, in the case of a com- 
pletely regular E, to Cech’s (T, 2, ch. IX, 1, ex. 7) compactification /3E of E. 
As a corollary to the above proposition, we therefore have: 

If E is not locally connected , then kE, a'E, and 0E are not locally connected 
either. 

3. Application to particular types of spaces. We now want to prove a 
similar statement for /3£, a'E and kE in the case of certain types of spaces E 
which include locally connected spaces as well as others. 

Let E be completely regular and suppose there exist denumerably many 
open sets O t C E whose closures are mutually disjoint and have a closed 
union. Then (3E is not locally connected. In each 0 ( one can find a descending 
sequence O i>k {k = 0, 1, 2, ... ; O li0 = 01) of open sets such that for each 
pair O iik , O t ' k+ i there exists a continuous function h iik on E, for which 

0 < h ifk (x) < 1, x € E\ 

hi. k {x) = 1 on and h i<k {x ) = 0 outside O itk . Now the sets 
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Af,« UO..i 

constitute the basis of a completely regular filter 9?: one has M i+ 1 C M t . 
Furthermore, the function 

{ hs % 6 Ot,u s ^ i 
0 otherwise 

is continuous, since 

Mi = U 0,. { 

S>i 

(by hypothesis concerning the O t ) vanishes outside M t , is equal to 1 on M i+ 1 , 
and assumes only values between 0 and 1. This shows the filter 92 is completely 
regular. 

Now, if /S£ were locally connected, each of the corresponding trace filters 
(that is, each maximal completely regular filter) would have a basis of con- 
nected open sets as proved above. Let 9)? 3 92 be maximal completely regular. 
By Zorn’s lemma, or similarly, owing to the compactness of 0E, there exist 
such 9)?. Then, as Mo € 972, there would exist a connected open set G C M 0 
in 972, and as M 0 is the union of the disjoint open sets 0 t , one would have 
GCO r for a certain r. This, however, would entail G C\ M r+ i = #, in con- 
tradiction to 92 C 972, which proves 0E is not locally connected. 

In exactly the same way, one obtains the following similar proposition: 
If a regular space E contains denumerably many open sets O t whose closures 
are mutually disjoint and have a closed union , then the extension a'E of E is not 
locally connected. Here one has to construct a regular filter 92 from sequences 
Ot.k, 0 t .o = 0 t , for which 0<,*+i C 0 iik holds, and then the proof proceeds as 
above. 

Finally, the same method gives this result: If a space E contains denumerably 
many disjoint open sets 0 U then the extension kE is not locally connected. In this 
case, one need only consider the open filter generated by the sets 

U O it (n = 0, 1, 2, . . .) 

i>n 

instead of the filters 92 above. 

A class of spaces satisfying the hypothesis required for E in the preceding 
arguments are the locally compact spaces which are denumerable at infinity. 
A space E of this type is the union of an ascending sequence M t (i — 0, 1, 

2, . . .) of open, relatively compact sets for which C M i+ \. Then_Af 1+ i — M t 
are disjoint open sets and any collection of open sets O t satisfying O t C M t+ i — 
Mi will have the desired property: for any x € U one has x € M, for some 
s and hence x 6 O,. This means is closed. 

We have, therefore, the following corollary: For locally compact spaces E 
denumerable at infinity, none of the extensions (3E, a'E and kE is locally connected. 
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MAXIMAL FLOW THROUGH A NETWORK 


L. R. FORD, Jr. and D. R. FULKERSON 

Introduction. The problem discussed in this paper was formulated by 
T. Harris as follows: 

“Consider a rail network connecting two cities by way of a number of 
intermediate cities, where each link of the network has a number assigned to 
it representing its capacity. Assuming a steady state condition, find a maximal 
flow from one given city to the other.” 

While this can be set up as a linear programming problem with as many 
equations as there are cities in the network, and hence can be solved by the 
simplex method (1), it turns out that in the cases of most practical interest, 
where the network is planar in a certain restricted sense, a much simpler and 
more efficient hand computing procedure can be described. 

In §1 we prove the minimal cut theorem, which establishes that an obvious 
upper bound for flows over an arbitrary network can always be achieved. 
The proof is non-constructive. However, by specializing the network (§2), 
we obtain as a consequence of the minimal cut theorem an effective compu- 
tational scheme. Finally, we observe in §3 the duality between the capacity 
problem and that of finding the shortest path, via a network, between two 
given points. 

1. The minimal cut theorem. A graph G is a finite, 1-dimensional 
complex, composed of vertices a, b, c, . . . , e, and arcs a(ab), 0(ac), . . . , 8(ce). 
An arc a(ab ) joins its end vertices a, b\ it passes through no other vertices of G 
and intersects other arcs only in vertices. A chain is a set of distinct arcs of 
G which can be arranged as a(ab), 0(bc), y(cd), . . . , 8(gh), where the vertices 
a, b, c, . . . , h are distinct, i.e., a chain does not intersect itself; a chain joins 
its end vertices a and h. 

We distinguish two vertices of G : a, the source , and b, the sink. 1 A chain flow 
from a to b is a couple (C; k) composed of a chain C joining a and b, and a 
non-negative number k representing the flow along C from source to sink. 

Each arc in G has associated with it a positive number called its capacity. 
We call the graph G, together with the capacities of its individual arcs, a 
network. A flow in a network is a collection of chain flows which has the pro- 
perty that the sum of the numbers of all chain flows that contain any arc is 
no greater than the capacity of that arc. If equality holds, we say the arc is 
saturated by the flow. A chain is saturated with respect to a flow if it contains 

Received September 20, 1955. 

‘The case in which there are many sources and sinks with shipment permitted from any 
source to any sink is obviously reducible to this. 
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a saturated arc. The value of a flow is the sum of the numbers of all the chain 
flows which compose it. 

It is clear that the above definition of flow is not broad enough to include 
everything that one intuitively wishes to think of as a flow, for example, 
sending trains out a dead end and back or around a circuit, but as far as 
effective transportation is concerned, the definition given suffices. 

A disconnecting set is a collection of arcs which has the property that every 
chain joining a and b meets the collection. A disconnecting set, no proper 
subset of which is disconnecting, is a cut. The value of a disconnecting set D 
(written v(D)) is the sum of the capacities of its individual members. Thus 
a disconnecting set of minimal value is automatically a cut. 

Theorem 1. (Minimal cut theorem). The maximal flow value obtainable 
in a network N is the minimum of v(D) taken over all disconnecting sets D. 

Proof. There are only finitely many chains joining a and b , say n of them. 
If we associate with each one a coordinate in «-space, then a flow can be 
represented by a point whose jth coordinate is the number attached to the 
chain flow along the/th chain. With this representation, the class of all flows 
is a closed, convex polytope in w-space, and the value of a flow is a linear 
functional on this polytope. Hence, there is a maximal flow, and the set of all 
maximal flows is convex. 

Now let S be the class of all arcs which are saturated in every maximal flow. 

Lemma 1. S is a disconnecting set. 

Suppose not. Then there exists a chain on, a 2 , . . . , a m joining a and b with 
a { $S for each i. Hence, corresponding to each a t , there is a maximal flow/ ( 
in which a t is unsaturated. But the average of these flows, 

m 

is maximal and a t is unsaturated by / for each i. Thus the value of / may be 
increased by imposing a larger chain flow on aj , a 2 , ... , a m , contradicting 
maximally. 

Notice that the orientation assigned to an arc of 5 by a positive chain flow 
of a maximal flow is the same for all such chain flows. For suppose first that 
(Ci, ki), (C 2 , kf) are two chain flows occurring in a maximal flow/, k\ > k 2 > 0, 
where 

Ci afla a 1 ) , q! 2 (®i® 2 )> • • • 1 &i(.aj— 1 , af), . . . , Q!r(^r— 1 , b) 

C 2 = 0i(a bi), fiflbibf), . . . , 0*(&*-i, b k ), . . . , 0,(6,_i, b), 
and aflaj-i, af) = p k (b k - lt b k ) € S, a HX = b k , a t = b k -i. Then 

C[ = a 1 , a 2 , . . . , atj- 1 , /3*+i, 

C 2 = 0i, 02, ... , 0*_i, aj+ 1 , . . . , a, 
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contain chains C 1", C" joining a and b, and another maximal flow can be 
obtained from / as follows. Reduce the C\ and C 2 components of/ each by k 2 , 
and increase each of the C” and C 2 " components by k 2 . This unsaturates 
the arc a jt contradicting its definition as an element of S. On the other hand, 
if (Ci, ki), (C 2 , k 2 ) were members of distinct maximal flows/i,/ 2 , consideration 
of / = ^(/i 4-/2) brings us back to the former case. Hence, the arcs of S 
have a definite orientation assigned to them by maximal flows. We refer to 
that vertex of an arc a 6 S which occurs first in a positive chain flow of a 
maximal flow as the left vertex of a. 

Now define a left arc of S as follows: an arc a of 5 is a left arc if and only 
if there is a maximal flow / and a chain a u a 2 , . . . , a* (possibly null) joining 
a and the left vertex of a with no a< saturated by/. Let L be the set of left 
arcs of 5. 

Lemma 2. L is a disconnecting set. 

Given an arbitrary chain ai(a ai), a 2 (aia 2 ), . . . , a m (a m _i b) joining a and b , 
it must intersect 5 by Lemma 1. Let a t (a t - 1, a t ) be the first a< 6 S. Then for 
each a f , i < t, there is a maximal flow /< in which a< is unsaturated. The 
average of these flows provides a maximal flow / in which a\, a 2 , . . . ,a,_i 
are unsaturated. It remains to show that this chain joins a to the left vertex 
of a t , i.e., a t _i is the left vertex of a ( . Suppose not. Then the maximal flow 
/ contains a chain flow 

[/3i(a&i), 0 2 (bi, b 2 ), . . . , fir(br— 1* b") *, k\, k ^ 0, oi(, b t —\ — a t , b t = a i. 

Let the amount of unsaturation in / of a { (i = 1 1), be k { > 0. 
Now alter / as follows: decrease the flow along the chain (3 1, 0 2t . . . , 0, by 
min [k, ki] > 0 and increase the flow along the chain contained in 

0!l* OC 2 , . . . , OC t— 1, 0i+l, • « • , 0r 

by this amount. The result is a maximal flow in which a t is unsaturated, a 
contradiction. Hence a t € L. 

Lemma 3. No positive chain flow of a maximal flow can contain more than 
one arc of L. 

Assume the contrary, that is, there is a maximal flow/i containing a chain 
flow 

[0i(abi) , 0i(bibi ) , . . . , 0 T (b r —i, £>); k], k > 0, 

with arcs 0 it 0j € L, occurring before &„ say, in the chain. Let/ 2 be that 
maximal flow for which there is an unsaturated chain 

ai(aai), <x 2 (a\, af), . . . , a,(a,_i, 6/-1) 

from a to the left vertex of 0,. Consider / = \(f x +/ 2 ). This maximal flow 
contains the chain flow [0 lt 0 2 , ... , 0 T \ k '] with k' > \k, and each a<(« = 1, 

. . . , s) is unsaturated by k x > 0 in /. Again alter/: decrease the flow along 
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/Si, /3 2 , . . . , /3 r by min [k\ ki) > 0 and increase the flow along the chain con- 
tained in ai, a 2 , . . . , a„ . . . , /3 r by the same amount, obtaining a maximal 
flow in which 0 { is unsaturated, a contradiction. 

Now to prove the theorem it suffices only to remark that the value of every 
flow is no greater than v(D) where D is any disconnecting set; and on the other 
hand we see from Lemma 3 and the definition of 5 that in adding the capacities 
of arcs of L we have counted each chain flow of a maximal flow just once. 
Since by Lemma 2 I is a disconnecting set, we have the reverse inequality. 
Thus L is a minimal cut and the value of a maximal flow is v(L). 

We shall refer to the value of a maximal flow through a network N as the 
capacity of N (cap ( N )). Then note the following corollary of the minimal cut 
theorem. 

Corollary. Let A be a collection of arcs of a network N which meets each cut 
of N in just one arc. If N' is a network obtained from N by adding k to the capacity 
of each arc of A, then cap ( N ') — cap ( N ) + k. 

It is worth pointing out that the minimal cut theorem is not true for net- 
works with several sources and corresponding sinks, where shipment is re- 
stricted to be from a source to its sink. For example, in the network (Fig. 1) 
with shipment from a t to b t and capacities as indicated, the value of a minimal 
disconnecting set (i.e., a set of arcs meeting all chains joining sources and 
corresponding sinks) is 4, but the value of a maximal flow is 3. 


b 2 ,b 3 



2. A computing procedure for source-sink planar networks. 2 We say 

that a network N is planar with respect to its source and sink, or briefly, 
N is a&-planar, provided the graph G of N, together with arc ab, is a planar 

*It was conjectured by G. Dantzig, before a proof of the minimal cut theorem was obtained, 
that the computing procedure described in this section would lead to a maximal flow for planar 
networks. 


FLOW THROUGH A NETWORK 


403 


graph (2; 3). (For convenience, we suppose there is no arc in G joining a and 
b.) The importance of o6-planar networks lies in the following theorem. 

Theorem 2. If N is ab-planar, there exists a chain joining a and b which 
meets each cut of N precisely once. 

Proof. We may assume, without loss of generality, that the arc ab is part 
of the boundary of the outside region, and that G lies in a vertical strip with a 
located on the left bounding line of the strip, b on the right. Let T be the 
chain joining a and b which is top-most in N. T has the desired property, 
as we now show. Suppose not. Then there is a cut D, at least two arcs of which 
are in T. Let these be <*i and a 2 , with ai occurring before a 2 in following T 
from a to b. Since D is a cut, there is a chain C\ joining a and b which meets 
D in ai only. Similarly there is a chain C 2 meeting D in a 2 only. Let C 2 ' be 
that part of C 2 joining a to an end point of a 2 . It follows from the definition 
of T that Ci and CY must intersect. But now, starting at a, follow C 2 ' to its 
last intersection with C i, then C i to b. We thus have a chain from a to b not 
meeting D, contradicting the fact that D is a cut. 

Symmetrically, of course, the bottom-most chain of N has the same 
property. 

Notice that this theorem is not valid for networks which are not a£>-planar. 
A simple example showing this is provided by the “gas, water, electricity” 
graph (Fig. 2), in which every chain joining a and b meets some cut in three 
arcs. 



Theorem 2 and the corollary to Theorem 1 provide an easy computational 
procedure for determining a maximal flow in a network of the kind here 
considered. Simply locate a chain having the property of Theorem 2; this can 
be done at a glance by finding the two regions separated by arc ab, and taking 
the rest of the boundary of either region (throwing out portions of the boundary 
where it has looped back and intersected itself, so as to get a chain). Impose 
as large a chain flow {T\k) as possible on this chain, thereby saturating one 
or more of its arcs. By the corollary, subtracting k from each capacity in T 
reduces the capacity of N by k. Delete the saturated arcs, and proceed as 
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before. Eventually, the graph disconnects, and a maximal flow has been 
constructed. 

3. A minimal path problem. For source-sink planar networks, there is an 
interesting duality between the problem of finding a chain of minimal capacity- 
sum joining source and sink and the network capacity problem, which lies 
in the fact that chains of N joining source and sink correspond to cuts (relative 
to two particular vertices) of the dual * 1 2 3 of N and vice versa. More precisely, 
suppose one has a network N, planar relative to two vertices a and b, and 
wishes to find a chain joining a and b such that the sum of the numbers assigned 
to the arcs of the chain is minimal. An easy way to solve this problem is as 
follows. Add the arc ab, and construct the dual of the resulting graph G. 
Let a' and b' be the vertices of the dual which lie in the regions of G separated 
by ab. Assign each number of the original network to the corresponding 
arc in the dual. Then solve the capacity problem relative to a' and b' for the 
dual network by the procedure of §2. A minimal cut thus constructed corres- 
ponds to a minimal chain in the original network. 

•The dual of a planar graph G is formed by taking a vertex inside each region of G and 
connecting vertices which lie in adjacent regions by arcs. See (2; 3). 
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ENUMERATION OF LABELLED GRAPHS 


E. N. GILBERT 

1. Introduction. The number of connected linear graphs having V 
vertices labelled 1, . . . , V and X (unlabelled) lines is found below. Similar 
formulas are found for graphs in which slings, lines "in parallel,” or both are 
allowed and for directed graphs with or without slings or parallel lines. Some 
of these graphs are also counted when the lines are labelled and the vertices 
are unlabelled. Another type of graph which is counted is connected, has no 
cycles of odd length (even graph), and has L labelled lines and X unlabelled 
vertices. 

Two graphs with labelled vertices are counted as the same if and only if 
for all i and j the same number of lines go from the vertex labelled i to the 
vertex labelled j in both graphs. Consequently two topologically equivalent 
graphs may be counted as distinct labelled graphs if they are labelled differently. 
An enumeration of some unlabelled connected graphs has been given by 
Riddell and Uhlenbeck (9) and by Harary (4). Riddell and Uhlenbeck also 
count connected graphs with labelled vertices in which slings, lines in parallel, 
and directed lines are ruled out. 

2. Graphs with labelled vertices. The kinds of labelled graphs men- 
tioned above are easy to count if one removes the condition that the graph be 
connected. For example the number of labelled graphs (including disconnected 
graphs) which have V vertices labelled 1, . . . , V and X unlabelled lines and 
no pair of vertices joined by more than one line (no lines in parallel) is clearly 

. For, every such graph is just a collec- 
tion of X of the $V(V — 1) lines which can be drawn between pairs of distinct 
vertices. More generally let P be any property of connected graphs. Let T v ,\ 
be the total number of graphs having V labelled vertices, X unlabelled lines 
and such that every connected component of the graph has the property P. 
Suppose the numbers T v ,\ are known and consider the number C v ,\ of 
connected graphs of V labelled vertices and X unlabelled lines which have the 
property P. To get a recurrence equation for CV.x note that in a graph J 
with V + 1 vertices and X lines the vertex labelled V + 1 belongs to a con- 
nected component K having some number v of other vertices and some number 
H of lines. The remaining part / — K of J has V — v vertices and \ — n lines. 

There are ^ ^ ^ ways in which v of the labels 1, . . . , V can be chosen to be 
Received December 5, 1955; in revised form February 6, 1956. 
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assigned to the graph K, then i, M ways of picking K, and T v-v,\-ii ways of 
picking J — K. Hence we conclude 

(1) ^V+i.x — 23 

v.li 


(:) 


c a+ i.,r v— t?,x— p 


In (1) we must make the convention that T’ 0 ,x=lifX = 0 and T Qi \ = 0 
otherwise. Introducing the generating functions 

Cv (y) — 23 Cv,\y x 

X 

and 


T v (y) = E Tv.,,y 

X 


(1) assumes a simple form 

(2) T r+l (y) = E ( ^ jC^WTv-.iy) 


which relates C v +i(y) (the term v — 7) to Ci(y), . . . , C v (y) and the known 
T n (y). 

For computing purposes (2) is quite convenient. However, we will also solve 

(2) for C v +i(y) explicitly in terms of T v (y). This solution is obtained in the 
form of the generating function 

(3) C(x,y) = 'ECv(y)x v /V'.. 


A compact derivation is achieved using the symbolic method (2; 6). We use 
a special pair of parentheses {...}„ to enclose expressions which are to be 
interpreted symbolically. The expressions inside such parentheses will be 
analytic functions depending on x and two letters C and T. The entire expres- 
sion, including parentheses, stands for the formula which is obtained when the 
analytic function is expanded into a power series and the term T m C n x i is 
replaced by T m (y) C n (y)x i for all m, n, i. For example (3) becomes 


C{x , y) = {exp Cx}, 

and (2) becomes 

(4) 7V +l (y) = {C{T + C) v } t . 


Multiplying both sides of (4) by x v /V\ and summing on V one derives 

{T exp Tx) s = {Cexp ( T -f C)x), 

= {Cexp Cx} s {exp Tx) s 

(5) {CexpCx},= {Texp 7*} ,/{exp 7x},. 


Integrating both sides of (5) with respect to x from 0 to x one also has 

{exp Cx) s — { C° } , = log {exp Tx} s . 

If by convention we put {C°} s = C 0 (y) = 0, then 

(6) C(x,y ) = { exp Cx } , = log {exp Tx} s . 
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Either of (5) or (6) provides an explicit solution for Cv(y ) which, in the usual 
notation, is given by the following theorem. 

Theorem 1. Let T v (y) be the generating function for the number of graphs 
with V labelled vertices , X lines , and such that each component has a given property 
P. Let C v (y) be the generating function for the number of connected graphs 
having V labelled vertices , X lines, and having property P. Then C v {y ) is (V — 1)1 
times the coefficient of x v+l in the power series for the quotient 

C v {y) is also V\ times the coefficient of x v in the power series for 

logE T v (y ) *7 VI. 
v«o 

In these series To(y) = 1 by convention. 


All the steps leading up to Theorem I can be justified using results in a 
paper on the symbolic method by Bell (2). In all our applications the power 
series {expC*}, has a zero radius of convergence; nevertheless CV(y) is 
obtained from either form of the generating series by formally expanding the 
generating series just as though it converged. One can also prove Theorem I 
using traditional methods but the proof is laborious. For example, expanding 
the generating function 

l°g(l + t.Tv(y)x T /V\) 


into a formal power series in x, Theorem I states that 


( 7 ) 


CAy) = - v\ £ 

*1 Dy 


(2b, — 1)\ ( T,(y) V‘ 

W...WV 1 ! / 


7V(y)V" 
V\ ) 


where the sum is taken over all partitions b\ + 2&2 + . . . + Vby — V. 
It is then possible to retrace the steps of our symbolic proof backward to show 
that the expression (7) is indeed the solution of the recurrence equation (2). 
It was noted above that there are 


hv(v-i) 

X 

graphs with X lines and V labelled vertices and no lines in parallel. Hence 
putting 

(8) 7VW = (1+ y) iv<v ~ l> 

in Theorem I or in (2) we count connected graphs with V labelled vertices 
and X lines, none in parallel. Setting y = 1 we count these graphs by vertices, 
allowing any number of lines. For V = 1, 2, . . . , 5 we find 1, 1, 4, 38, and 728 
connected graphs. The 38 connected graphs with four labelled vertices fall into 
the six topologically distinct types shown in Figure 1. 



I 
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J 

< 1 

1 


( i 3 

12 

12 




Figure 1 




Similarly to count graphs in which any number of lines in parallel are 
allowed T v ,\ is the number of combinations of X lines drawn from |F(F — 1) 
different kinds of lines with repetitions allowed, i.e. 

and 

(9) T v (y) = (1 - y)-' nr - 1 \ 

A sling is a line of a graph which has both its end-points the same. Two 
slings which share the same vertex will be considered to be in parallel. Slings 
were ruled out in the above enumerations. If slings are to be allowed the same 
sort of argument applies. Now there are V additional kinds of lines to choose 
from so that the terms \V(V — 1) in (8) and (9) are to be replaced by 
%V(V+ 1). 

Similarly various kinds of directed graphs may be counted. For instance 
if the graphs are to be composed entirely of lines directed between different 
vertices but slings are excluded there are V(V — 1) kinds of lines and the 
exponents of (8) and (9) are to be multiplied by 2. Note that this modification 
of (8) counts two lines joining the same pair of vertices as not in parallel if 
they have opposite direction. To count directed graphs with slings the expo- 
nents in (8) and (9) are changed to F 2 , which is the number of kinds of directed 
lines including the F slings (which can have only one direction). A diadic 
relation may be interpreted as a directed graph in which there is a line directed 
from i to j if i has the given relation to j. In a paper enumerating structures of 
relations, Davis (3) counts certain kinds of topologically different (i.e. 
unlabelled) directed graphs. 

Our results are summarized most compactly by the generating functions of 
the following theorem. 


Theorem II. The number of connected graphs having V labelled vertices and 
X unlabelled lines is VI times the coefficient of x r y x in a generating series of the 
form 

(1 + ay)"’ 10 ■ 


( 


logl 1 4- 


z 

i-1 


i\ 


a ). 


where 
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and 


j — 1 if lines in parallel are allowed, 

1 1 otherwise, 

for ( undirected ) graphs without slings , 

^ for {undirected) graphs with slings, 
i(i — 1) for directed graphs without slings, 
i for directed graphs with slings. 



The formula with a — 1 , /3(f) ~ (j% ) was derived 
(8) using quite different techniques. 


by Riddell and Uhlenbeck 


3. Graphs with labelled lines. Graphs having labelled lines and un- 
labelled vertices may be counted by exactly the same technique as used in 
§2. Equation (1) remains true if we interpret C v ,\ as the number of connected 
graphs with property P and having V lines labelled 1, 2, , V and X unlabel- 

led vertices and T v .\ as the number of (perhaps disconnected) graphs having 
V labelled lines, X unlabelled vertices and such that every component has 
property P. With this change in viewpoint Theorem I counts connected 
graphs with labelled lines if the corresponding graphs (not necessarily con- 
nected) can be counted. 

As an example of such an enumeration, let P be the property that every 
cycle of the graph must be of even length. Konig (7, Ch. XI, §4) calls such a 
graph an even graph. We now use L for the number of labelled lines. Lines in 
parallel will be permitted but slings are cycles of odd length and so are ex- 
cluded automatically. 

The vertices of even graphs separate into two classes A and B with the 
property that every line has one end-point in A and one in B. The connections 
among the L end-points in A and the L end-points in B form two groupings 
of the L lines into clusters; all lines belonging to the same cluster are connected 
together at a single vertex. It will be convenient first to count even graphs 
in which each vertex carries a label A or B to show to which of the two classes 
it belongs. Let a denote the number of vertices in A and b the number in B. 
The number of graphs with these values of a and b is just S(L, a) S(L, b) 
where S(L, k) is the number of ways of putting L different objects (the end- 
points) into k groups (the vertices) so that no group is empty. S(L, k ) is a 
Stirling number of second kind (5, p. 179) and is given by the generating 
function 

22 S(L, k)y k y j = exp [y (exp s - 1)]. 

L.k 


( 10 ) 
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The generating function T L (y) for the number of graphs with X vertices is 
then 

T L (y) = Z S(L, a) S(L, \ - a) / 

(11) = (E S(L, k ) /)’. 

Theorem I together with (11) solves the problem for even graphs with 
A , B labels on the vertices. When the A , B labels are removed one finds that 
most of the even graphs were counted twice (once for each choice of the A , B 
labelling). However, those graphs for which the A and B partitions were 
alike were counted just once. Of course there is only one connected even graph 
of this sort (the graph with two vertices and all L lines in parallel). Hence the 
generating series c(x , y) for even graphs is one half the sum of the series for 
even graphs with A , B labels plus the series 

£ 1 • y* 71 = /(exp x - l). 

L-l 

The final result is 


Theorem III. The number of even graphs with L lines labelled 1,2 , ,L 
and X unlabelled vertices is L\ times the coefficient of x v y x in the generating 
series 

2^/ (exp x - 1) + log^l + 2 T *(y) *VJh)} 
where T k (y) is given by (11). 


To count even graphs by lines allowing any number of vertices, set y = 1. 
Then 3T*(1) = G\ where G k is the number of ways of grouping k distinct 
objects into any number of clusters. A table of G k is given by Bell (1). For 
L = 1, 2, . . . , 5 there are 1, 2, 8, 60, 672 even graphs. The 60 even graphs 
with 4 labelled lines fall into 10 topologically distinct types shown in Figure 2. 

A simple enumeration may be given for connected directed graphs allowing 
lines in parallel and slings. If we drop the connectedness requirement the 
number of such graphs which have L labelled lines and X unlabelled vertices 


1 2 3 A 


12 LABELLINGS 






6 3 3 4 1 

Figure 2 


Copyrighted material 


ENUMERATION OF LABELLED GRAPHS 


411 
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is S(2L, X); for, the vertices represent clusters of end-points taken from the 
2 L distinct end-points of the L directed lines. Theorem I now applies with 

T L (y) = £ S(2L, X)/. 

A 

This enumeration can be modified to rule out slings. The number S(2L, X) 
must now be replaced by the number of groupings of 2 L distinct end-points 
into X clusters such that no cluster contains both end-points of a line. This 
number is easily found by the principle of inclusion and exclusion (8) 

T L .X = £ (-!)*( \) 5(2 L - k, X). 

Undirected connected graphs with lines in parallel allowed but with no slings 
may now be counted by noting that, with one exception, each such graph 
contributes 2 L directed graphs to the preceding enumeration when the lines 
are directed in all possible ways. The exception is the graph with two vertices 
(L lines in parallel), which contributes only 2 L ~ l directed graphs because a 
simultaneous reversal of all L directions on this graph does not produce a new 
directed graph. 
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CERTAIN INFINITE ZERO-SUM TWO-PERSON GAMES 


A. L. DULMAGE and J. E. L. PECK 


1. Introduction. The theorem of von Neumann, that every finite, zero-sum 
two-person game has a value, has been extended in various ways to infinite 
games. In particular Wald (6) has shown that any bounded game in which 
one player has finitely many pure strategies, has a value. Our interest was 
aroused by the infinite analogue of the game of “hide and seek” as described 
by von Neumann (5), which does not appear to fit any of the known cases, 
unless the matrix is bounded. However, the bounded game is dull since its 
value is zero. This has led us to give another set of sufficient conditions under 
which an unbounded infinite game may have a value. 


2. Notation and definitions. The game (7, J , K) will consist of two 
arbitrary sets 7 and J and a real function K on the product set 7 X J. I f the 
maximising player chooses i € 7 and the minimising player independently 
chooses j € /, then the former receives the amount K (i, j) from the latter. 

Let £ = {*<; i € 7} denote a vector with dimension the cardinality of 7, 
such that £ Xi — 1 and all x t > 0, the sum being taken in the sense of Bour- 
baki (3, Ch. Ill, §4). The vector £ will be used as a mixed strategy for the 
maximising player. Similarly rj = {y^} will be used as a mixed strategy for the 
minimising player. We write 

K&r,) = T,K(i,j)x, yi 
i.j 

when the expression on the right is summable in the sense of Bourbaki (3). 
For a particular £, 

inf X(£, v) 

will denote the infimum over all those 77 for which K(£, rj) exists. We write 


Vjj = sup inf i£(£, 77) 

£ v 

and similarly 

Vij = inf sup 7 C(£, 77). 

V f 


If the sets 7 and J are finite then the fundamental theorem states that 
Vjj — Vjj. If J is a finite set and the K(i,j ) are bounded then it is known 
(1) that Vu = Vjj. In the general case, if i£(£, 77) exists for all £ and 77 then 


( 1 ) 


ViJ < Vjj. 


Received, October 20, 1955. This work was done while the authors were fellows of the Sum- 
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However, as we may see from an example of Bohnenblust, Karlin and Shapley 
( 2 ), if there is some K(£, rj) which does not exist, we may have v TJ > vjj. 

Games for which ( 1 ) holds we call admissible, and we say that any game for 
which Vu = v TJ is determinate, or has a value. Any game for which the K(i,j) 
are bounded below or bounded above is admissible, because all the K(£, 17 ) 
exist. It is a simple matter to construct admissible games and in fact deter- 
minate games in which, for some £ and tj, K(£, rj) does not exist. For any 
admissible game we have from ( 1 ) and the definitions that 

(2) Vij K VlJ K Vln, 

for any subset n of J. 

If M is the collection of finite subsets of I directed by inclusion, then 
lim</(i) = A will mean that for all e > 0 there exists an m £ M such that for 
all i $m, | f{i) — A\ < e. Similarly lim mtM f(m) — A will mean that for all 
c > 0 there exists an m' 6 M such that for all m 3 m’ , \f(m) — A\ < e. 
There are obvious modifications in the case where A is not finite. Similarly N 
will represent the collection of finite subsets of J. We denote the cardinal 
number of the set $ by |s|. 

3. A sufficient condition for an admissible game to have a value. 

Theorem. If the game (/, J, K) is admissible and if for each j € J there is a 
real number L } such that 

(3) inf K(i,j) = Lj 

(4) lim K(i,j) = L Jt 

< 

then the game has a value 

v — lim v In , v < + oo, 

ntN 

and the maximising player has an optimal strategy. 

Proof. For any n € N, the game (/, n, K) is bounded by hypothesis, and 
so (1) has a value v jH = tf/„ = v lH . But if n C n', then v In > v In > so that we 
may write 

(5) v = lim v In 

ntN 

and v < + <» . From (2) and (5) we have that 

(6) i) u K Vij ^ v. 

If v = — oo , then the game has the value — °o and every strategy for the 
maximising player is optimal. Thus we need only consider the case where v 
is finite. 

For each n 6 N we may choose a strategy £„ = {x nl } for the maximising 
player which is |»| -1 optimal for the game (/, n, K) in the sense that for all 
j € rt 
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(7) K(jL,j) = E *(». J>»i > - \n\~\ 

i 

Since the closed interval [0, 1] is compact, it follows from the Tychonoff 
theorem that the product ir of [7| of these intervals is compact (4) in the 
topology of coordinate-wise convergence. Since for every n £ N, £ n = {x„<} 
lies in t, the net {£„;» € -N} has a convergent (4) subnet {£„; n £ N'\. 
For each i put 

Xt = lim x nl . 

ntN’ 

Then for every i £ / 

(8) 0 < */ < 1, E < 1. 

it I 

We now write £' = {x/} even though £' may not be a strategy. 

Put L («,./) = K(i,j ) — Lj, then from (3) 

(9) L(i,j) > 0, 
and from (4), 

(10) lim L(i,j) = 0. 

i 

With the obvious interpretation of L(£ n ,j) and Z, (£',./) we shall prove that, 
for all j £ J, 

(11) Hm L(W) = £(r,j). 

ntN' 

In fact, given e > 0, choose, by (10), a finite subset m } C I so that when 
i #m /t L(i,j) < $e, and putting 

= ma xL(i,j), 

t 

choose «' € N* so that whenever n follows n' in N' we have for all i £ m } 
that |xw< — x% \ < e(3|mj|2? i )~‘ 1 . Then 

\U(.j) - m'j ) i = i e Ui,j)x. t - e 

i i 

< E - *i'| + E £(*,j)*.<+ E L(i,j)*; 

i€.m } i^nij ifyntj 

< B, E «(8|m,|B,)-* + l<E + s«E 

<«my < < 

^ 1 , 1 , 1 

< sc -+• sc + sc = e. 

Since for each j £ n, we have from (7) that 

vm ~ |»| -1 < L(£«J) + L } 
it follows from (5) and (11) that for all j £ J 

(12) v < L(S',j) + L } . 

Let £ Xt = 0, then from (8) 0 < 0 < 1 and if £* = {x,*} is any mixed 
strategy, put £ — £' + (1 — 0)£*, that is for all i, x< = x% + (1 — 6)x*. 
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From (8) we have that £ jc, = 1 and that all x { > 0 so that £ is a mixed 
strategy. Using (12), (9) and the fact that x t ‘ < x it we have for all j £ J 
that 

■> < E L(i,j)x t ’ + L, < D L(», i)* ( + i,E *<= E *(»,.;)*, = 

i i it 

and therefore that 

i; < infJC(£,j). 

i 

However, from the definition of v IJt 

inf K(Z,j) < vu; 

s 

so, using (6), 

v < inf K(£, j) < Vjj < Vu < t>. 

i 

This proves that the game has a value v , which by (5) is 

lim v 

ntN 

and that £ is an optimal strategy for the maximising player. 


4. The infinite game of hide and seek. This game is played on a count- 
ably infinite matrix (a fi ) where a {) > 0. The hider chooses a place ( i,j ) and 
the seeker chooses either a row i or a column j and if he “finds” the hider, 
the amount a t} passes from the hider to the seeker. In the finite n X n case, 
von Neumann has shown (5) that the value of the game is S n ~ l , where, if P n 
is some permutation of the integers i = 1 », 

n 

S n = max^ (a„p.) -1 * 

Pn <-l 

We observe, in both the finite and the infinite case, that corresponding to 
every pure strategy of the hider, the seeker has only two pure strategies in 
which the pay-off is positive and in all other cases it is zero. If this game is 
considered in the normal form (/, /, K ), this means that, for each j, every 
K ( i , j) is zero except for two which are positive. The game is clearly admissible, 
since the K{i,j) are bounded below by zero. The conditions of our theorem are 
easily satisfied with Lj = 0 for all j and the infinite game therefore has the 
value 

v — lim v„, 

ti-ko 

where v n is the value of the game ( oo , n 2 , K). However the game (» , n 2 , K ) 
is clearly equivalent to the game (2 n, n 2 , K), whose value is 5,~ l , because if 
the hider is restricted to a square, the seeker would not seek outside it. 
Thus 

v = lim v n = lim S n ~ . 

n-» oo n-KD 
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If we let P be any permutation of the set of positive integers onto itself and 
if 

oo 

•S’ = sup 52 

p i-i 

then it is easily shown that 

lim S n = S, 

n- k*> 

and therefore that v = S~ x . In fact, if 5 < then we can choose P so that 

53 > S — K 

<-i 

and then choose n so that 

53 > S — e. 

<- i 

If 

m = max i p , 

K« n 

then we have 

n 

S > Sm> 53 > S - 6. 

1-1 

The case S = <» is similar. This justifies our statement in §1 that the value 
of the bounded game is zero, since in that case 5 = °° . 

There exist unbounded hide and seek games in which the value is not zero, 
for example if a i} = 2 mai(<,/) , the value of the game is 1. 

There is the same connection between the game of hide and seek and the 
optimal assignment problem, as in the finite case, but with obvious modifica- 
tions. 

More general games of hide and seek can be considered as played on a 

/-dimensional array ( a tl <,) where the hider chooses a place and the 

seeker chooses some r subscripts, r < /. Our theorem shows that such infinite 
games have a value. 
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ON THE BASIS PROBLEM FOR VECTOR VALUED 

FUNCTION SPACES 


H. W. ELLIS 


1. Introduction. In a recent paper (2) Halperin and the author con- 
sidered separable Banach spaces L x of real valued functions on general measure 
spaces and proved the existence of 1 -regular (§2) Haar or a - Haar bases when X 
was the classical £-norm or any levelling length function (3) and, more generally, 
of E-regular Haar or tr-Haar bases when X was a continuous length function 
satisfying certain additional conditions (2, Theorem 3.2). 

In the present note, separable spaces L X (S ; X), T*(S; X ) of functions valued 
in a normed vector space X on a general measure space S are considered and 
the existence of a 3EE'-regular basis is established when L X (V X ) has a ^-regu- 
lar Haar or a-Haar basis and X has a E'-regular basis. 


2. Terminology. 5 will denote an arbitrary space of points P with a 
countably additive, non-negative measure y(E) defined for a complemented, 
countably additive family of sets; X will be an arbitrary length function; L X (S) 
will denote the Banach space of real valued functions f(P) on 5 with X(/) 
defined and finite; X will denote an arbitrary normed vector space with real 
scalars; || || the norm in X; L X (S] X) the space of Bochner measurable 
functions (4) f(P) valued in X on S with X(/) = X[/(P)] = X(||/(P)||) de- 
fined and finite. (If X is complete L x (-5; X) is a Banach space (3).) 

Upper case letters will be used for arbitrary measurable sets, lower case 
letters will always denote measurable sets of finite measure ; f B (P) will denote 
the function equal to f(P) in E and vanishing elsewhere and X(E) will be an 
abbreviation for X(1 B ). 


Definition. A basis {*<} in X will be called K-regular , if 


2.1 



<K\\x\\, 1 <E < <», 


n = 1, 2, . . 


for every x = Y, a iXi € X. 

The referee has pointed out that, if X is a Banach space, Banach’s bounded- 
ness theorem (1, p. 80) shows that any basis in M is a E-regular basis for some 
K. 


3. Bases in L X (S;X). Suppose that X has a E-regular basis {x t }. If 
f(P) is valued in X, 

3.1 f(P) = E«,(P) x tl 


Received May 26, 1955. 


417 


418 


H. W. ELLIS 


the di(P) being determined uniquely for each i and all P. We shall show 
that if/(P) € L X (S;X) then the real valued functions a,(P) (i = 1, 2, . . .) 
belong to L x (5). This involves showing that each cii(P) is measurable with 
X[a 1 (P)] < oo . We note that the uniqueness of the representation x = x^X 
implies that if/(P) is constant in E so is each a 4 (P). Since 


3.2 |a,(P)| | M| < 



dj{P)Xj 


+ 


<-i 


E 


dj(P)Xj 


<2K\\f(P)\\, 


and f{P) is the almost uniform limit of countably valued functions, each a<(P) 
is the almost uniform limit of measurable (countably valued) functions and is 
therefore measurable. Thus X(a<) is defined for each i. Using (3.2) and pro- 
perties (L 2) and (L 4) of length functions ( 3 ), 


xmp)] - x(MP)|) < 2XX(|1/(P)||)/|M| < •. 


Lemma 3.1. If f(P) € L X (S; X), where X is d continuous length function 
dnd L X (S) is sepdrdble , if X hds o K' -regular busis {x t } , and if (3.1) holds , 
then 

3.3 !im A /(P) - £ a,(P)x, - 0. 

n-yco L 1 J 


Proof. First suppose that y(S) < oo. Given e > 0, let e denote the set of 
points P for which 


f(P) - £ a t (P)x t 


< € 


for all n > N. Then y(S — e) — » 0 as N — > » , 

^[/(i 5 ) - E a,(P)x , ] < , 7 ( 5 ) + (1 + K’) 

and \{f s-e) — > 0 as N— * » by ( 2 , Lemma 3.2). 

If 5 is arbitrary there exists e' by ( 2 , Lemma 3.2 (iii) ) with \(J — f e >) 
arbitrarily small, 

*\f(P) - E a,(P)x , ] < (1 + K') \(J + {(f - t, a,(P)*,)..] 

and the right side can be made arbitrarily small by choice of e' and n. 


Lemma 3.2. Letf(P) 6 L X (S; X), let L X (S) have a Haar or a-Haar basis and 
X have a K'-regular basis {M and suppose that (3.1) holds. Then (3.3) holds. 

Proof. The assumption that L X (S) has a a-Haar basis implies that L X (S) 
is separable and that S — E + U e n where \(J B ) = 0 for every f(P) £ L x (5) 
and where the a-Haar basis functions correspond to a a-Haar system of sets 
H a (\Je n ) which forms a countable basis ( 2 ) in 5. Then for arbitrary 
f(P) € L X (S), where 
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UP) 


f(P) in he n , 

i 

0 elsewhere, 


\(J) — A(/ s-b) — sup A(/v), 

N^co 


by (L 5) for length functions so that A is continuous and Lemma 3.1 applies. 
A similar argument applies if L X (S) has a Haar basis. 

Corollary. Under the hypotheses of Lemma 3.2, A[a n (P)] — > 0 as n — > <» . 


If {^(P)} is a basis in L X (S), afP) = X/j<^(P), (* = 1,2,...), the 
coefficients a i} being uniquely determined. Thus with each /(P) 6 L X (S ; X) 
can be associated a unique double series 

3.4 /(P)-E«[E/WP)]*<. 

We shall show that the {Xi<j>j(P)}, ordered suitably into a single sequence, 
form a basis in L X (S; X). 

In the proof of the next lemma the Bochner integral would be used if X 
were a Banach space. To extend the proof to an arbitrary normed vector space 
X, we generalize the Bochner integral for certain functions by defining, where 
x t 6 X , gi(P) € L(S) (i = 1,2,..., n), 

PO = Z X, f s g,(P)<tr(P). 

We shall use the fact that 


3.5 




dy(P). 


This is easily shown if the g t are finitely valued, constant in the same sets, 
and the general result is then obtained by standard arguments. 

Lemma 3.3. If {<t>i(P)\ is a K -regular Haar or a-Haar basis in L X (S), if 
{*<} is a K' -regular basis in X, then for all m, n, 


3.6 


i EE a<iX,<h(P) < KK'\{f). 

L i-i J 

Proof. By (2, Lemma 2.1, Corollary 1) 

Z) a ij<t>j(P) = 2 ([tC^) - 1 a<(P) dy(P) J in e^j 

for some sequence of Haar or cr-Haar sets e r depending only on n where 


JV 


U e T — 6’ 


(E defined as in Lemma 3.2) if L X (S) has a Haar basis and where 


Copyrighted material 


420 


H. W. ELLIS 


23 a i) < i > } (P) 

1 

vanishes outside \Je, if L X (S) has a a-Haar basis. Let ||/(P)1| = £ 
Then 



a t jXi<j>j{P) 


}- 4 5 -4, ([*■>“ 

1 J* a,(P) dy(P'?\ in «,)} 

- { s(I>>"<« i 

in 

<4§ ([,«,)-' jj 

| E) “i(-P)*i| in 

i-i; 

||/(P)||<iT(P)]ine r )} 

= K'\{ E b,UP)} 


<KK'X\f(P)). 



The sequence 


E 


E 


atjXitjiP) 


is .OC'-regular in Z, X (S; X ) and suggests that the Xi<t>j(P) be ordered so as to 
give partial sums differing as little as possible from square or rectangular 
sums. To this end we order then as follows: Xi<f>i, x 1 ^ 2 , * 2 $i. xz<t >%, ...» 

Xn^l} X n <f) 2, • ♦ • 1 #n0n— 1» • • • » X n (f) nf .... Given _/(P) £ L X (5, X) f 

consider 3.4 and let Sy = S N (f) denote the sum of the first N terms atfic^^P) 
with the above ordering. Let 

M N 

S\f,N — 23 23 atjXi<f>j(P ). 

<-1 >. 1 

Then 


n— 1 n— 1 AT-(»-l)2 

3.7 Sy = 22 23 a ij x i<f>j(P) + 23 OnjXn<t>j(P), (n ~ l) 2 < TV < «(» — 1) 

i-1 >-l i-l 

n n— 1 N— n («— X) 

= E E a t jXi<f)j(P) + 21 ai n Xi<t> n (P), n(n - 1) < TV < » 2 . 

i-i j-i i-i 

3.8 Sy = + ‘Sn.AT-tn- 1) J ~ (« — ) 2 < N < «(« — 1); 

“ *Sn,»— 1 T" *SjV— »(«—!),« *5 y— «( n— 1) ,«— 1> w (w 1) <C TV ^ n 2 . 

From 3.7, 3.8 and Lemma 3.3 we obtain for every / £ L X (S;X) and all N 

3.9 X[Sy(/)] < 3 KK'\(f). 
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Theorem 3.1. If L X (S) has a K-regular Haar or <x-Haar basis (<^(P) } and if 
X has a K' -regular basis {*<} then the Xi<t>j{P) with the ordering of the preceding 
paragraph form a SKK' -regular basis in L x (5; X). 

Proof. We shall give the proof where L X (S) has a a - Haar basis, the proof 
where there is a Haar basis being similar and simpler. We can suppose that the 
a - Haar basis functions correspond to a <r-Haar system of sets H a (S) dense in 5 
(cf. Lemma 3.2). As in Lemma 3.2 each / £ L X (S]X) is the strong limit of 
functions f e (vanishing outside sets of finite measure). Each f e is the almost 
uniform limit of finitely valued functions and an easy computation using 
(2, Lemma 3.2) shows that these functions converge strongly to f e in L X (S\ X). 
Finally each set of constancy of an arbitrary finitely valued function g can be 
approximated arbitrarily closely by finite collections of sets of II V (S) with the 
corresponding functions converging strongly to g. We conclude that finitely 
valued functions with sets of constancy in H a (S ) are dense in L X (S;X). 
Since 

M/CP) - s s (f)] < x[/(P) - HP)] + M HP) - SAh)] + x [£*(/) - s„m 

< (1 + 3 KK’) \(J-h)+ X[* - SAH], 


for any/, h in L X (S; X) it vviH be sufficient to prove that X[/ — 5y(/)] — * 0 as 
n — > oo where/(.P) is a finitely valued function with sets of constancy in H a {S). 
Then, if/(P) = X^iCP)**. each a<(P) is finitely valued with the same sets 
of constancy as /(P) and by ( 2 , Lemma 2 . 1 , Corollary 3) there exists n 0 
with 

n 

2 a i J < i ) j(P) = a<(P), n > no, i = 1,2, 

j - 1 


Then 


X[/- S s (f)] < x[/- X>>CP)*,]+ x(e*,[< 2,(.P) - Ea„<f>,(P) ]) 


[ AT— (n— 1)2 

Xn 2 a nj <t>j(P) J 


(n — l ) 2 < N < n(n — 1 ), 


= Xi + X2 + Xa 


[ n ”1 /i\T— n(n— 1) T n ”1\ 

/ - a t (P)x t j + X\^ 2 x^a t {P) - 2 ««0i(-P)Jj 

+ x( Oi(P) ~ ^ij<t>j(P) ) »(» — 1) < N < n 2 , 

\<-V-n(n-l) + l L j = 1 J/ 


— Xi + X 2 4- X 3 ; 


Xi — > 0, X/ — ► 0 as n — > a> by Lemma 3.2, X 2 , X 2 ' and X 3 ' vanish if n — 1 > n 0 
and X 3 < K\ |x n | | X[a„(P)] — > 0 as n — > °° by Lemma 3.2, Corollary. 

Write X/ for sums of terms OijX^jiP) ordered as in Theorem 3.1. We have 
established the existence of a convergent series Yi' a i)Xi<f>j(P) with sum /(P) 
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for every f(P) £ L X (S] X ). The Xi<f>j(P) with the specified ordering will be a 
basis in L X (S; X) if there is only one such series for each f(P) and that this is 
true is a consequence of the uniqueness of the respective series for elements of 
X and L X (S) in terms of the bases {;*:<}, {< f>j(P )}. 

The referee has observed that a .^-regular basis in X is a basis in the com- 
pletion of X, so that there would be no loss of generality in assuming through- 
out that X is a Banach space. This would permit the use of the Bochner 
integral in Lemma 3.3. 

4. Bases in I^GS; X). In §3 the assumption that X is a length function 
implies that L X (S) is a Banach space. The above arguments remain valid for 
more general function spaces. Consider a general normed vector space of 
measurable functions f(P) with norm X(/) = X(|/(P)|). The definition of a 
norm implies properties (LI), (L 3) and (L 4) of length functions for X. 
Property (L 2) has played a fundamental role in the proofs in §3. However 
if the normed vecor space is not required to be complete the results in §3 can 
be obtained with (L 5) replaced by weaker assumptions. 

Suppose that for every measurable function u with 0 < u(P) < <» for al- 
most all P, X(«) is defined with 0 < X(«) < <» and satisfies (L 1)-(L4) for 
length functions, 

(L 5') If e is fixed, e' C e, X(/) < » , then \(J e ) — X(/ e <) — ► 0 as 7(0 — e')— *0, 
and 

(L 6) X(w) = sup e X(w e ) (i.e. X is continuous). 

Let 1^(5) denote the space of real valued functions f(P) with |/(P)| meas- 
urable and \{f) = X(j/|) < <» and let X) be the analogue of L x (5; X). 

I^GSlX) is a normed vector space. If F X (S) is separable and X(/) < «>, the 
argument of (2, Lemma 3.2) gives: 

(i) X(/ e ) — > 0 as 7(c) — * 0, and 

(ii) there exists e with X(J — f e ) arbitrarily small. With L x replaced by V x 
Lemmas 3. 1-3.3 and Theorem 3.1 then hold. 
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ON EXPLICIT BOUNDS IN LANDAU’S THEOREM 


J. A. JENKINS 


1. The theorem of Landau in question may be stated in the form that if the 
function F(Z) is regular for \Z\ < 1 and does not take the values 0 and 1, 
while 

F(Z) = <io -h &iZ -f- • • • 

is its Taylor expansion about Z — 0, then K| has a bound depending only on 
a 0 . In fact K| has a bound depending only on Ml and Hayman (1) gave the 
explicit bound 

Ml < 2 Mlllog |<Zo|| + 5x}. 

In a recent paper (2) I gave a simple method for obtaining explicit bounds in 
Schottky’s Theorem and applied it also to improving the above bound 
to 

Ml <2|a 0 |{|log KIM- 7.77}. 

Since writing that paper I have observed that by relatively small modifications 
of the argument that bound can still be substantially improved. 


2. It is well known that, for a given ao, the maximum value of K | ls attained 
for the function F 0 (Z) mapping |Z| < 1 onto the universal covering surface 
of the finite IT-plane punctured at 0 and 1 and taking the value a 0 at Z = 0. 
Now |Z| < 1 is mapped conformally onto 9?z > 0 in such a way that if the 
mapping function is Z — Z(z) and we set F 0 (Z(z)) = /(z), then for a suitable 
branch of log/(z) the mapping 

W = log/(z) + x i 

(where — or + is chosen according as $a 0 > 0 or $a 0 < 0) carries the domain 
determined by the inequalities 

— ir < 3z < 7r, 9?z > 0, |z — ^7ri| > |x, |z -f hvi\ > b r 


onto the strip 


— x < < x 


so that the boundary points ±xi correspond to themselves. Further, the 
boundary points of these domains at infinity in whose neighborhoods 9?z, 
become large and positive correspond and the boundary point z = 0 corres- 
ponds to the point at infinity in whose neighborhood ffiw becomes large and 
negative. We denote the point in the z-plane corresponding to Z = 0 by b. 
Moreover we set f = e ~ *, = e~ v and denote the corresponding mapping 
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between these planes by f = 0 (cj) or co = ^(f). The function <t>(u) is regular 
and univalent for |w| < 1 with <£'(0) = 1/16. 

Next we observe that, as was proved in (2, p. 80), in obtaining a bound of 
the form 

hi < 2 fa 0 f { |Iog |a 0 || + K), 

it is enough to confine ourselves to the situation |o 0 | > 1, |a 0 — 1| > 1. Then 
we use distinct arguments according as |a 0 | is near 1 or bounded from l.For 
|a 0 | near 1 we use the fact that under the mapping from the z-plane to the 
w-plane the half-plane 9?z > is mapped into the 70 -plane slit along the 
half-infinite segments $w = (2 n + 1)tt, 9 ?t 0 < 0, n running through all 
integers. Comparing the inner radii of these domains with respect to b and its 
image with the derivative of the mapping function, namely ai/2a 0 ?Hb, we get 
the bound (2, p. 81) 

|ai| < 2(|a 0 | |a 0 — l|)*log|2a 0 — 1 + 2{a 0 h — 1)}*| 9?6/(9?5 — %t). 
Since the conditions |a 0 | >1, |a 0 — 1| > 1 imply we have for 

hi = t, t > 1 

|ai| < (3 + 3*) h|(l H" C 1 )* log[2/ + 1 + 2(/ 2 /)*] 

< 2hl(loghl 4* A (/)} 

where 

A(/) = §(3 + 3*)(1 + C 1 )* log[2£ -f- 1 -+- 2 (^ 2 -+- J)*] — log t. 


Unlike the function L(t) used previously the function A(f) is not monotone 
increasing. However direct calculation shows that on the range / > 1 it first 
decreases to a minimum and from then on increases. Thus, on an interval 
1 < t < to, A (t) does not exceed the larger of A(l) and A(/ 0 ). It proves 
advantageous to take the interval 1 < t < 1.84. We readily find 

A(l) < 5.90, A (1.84) < 5.94. 

Thus for 1 < / < 1.84, A(/) < 5.94 and for 1 < |a 0 | < 1.84 we have 

hi < 2|a 0 | {loghl + 5.94}. 


Now we apply to the function <£(«) instead of the bound previously used 
(2, p. 81) the result due to Robinson (3, p. 444) 

16|!f 2 


dt_ 

dw 


1 2 1 — |<^| 


w 


Using the fact that 


we get 


</>'( — do 1 ) = — 


2a» 

a\e b 


|oi| < , I?" 1 * ■ 

flo — 1 
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Moreover (2, p. 79) 


e* b < 16ja 0 | + 8, 


so 

M < (2|a 0 | + 1) log(16|a 0 | + 8) j— ^ — r . 

|ao| “l 

Then for |a 0 | = t, t > 1, we have 

|fli| < 2|a 0 | {log|a 0 | + M(t)}, 

where 

M(t) = (/ + \)t (/ 2 - l)” 1 log (16/ + 8) - log /. 

Direct calculation shows that M (/) is decreasing for / > 1. Now ilf (1.84) <5.93. 
Thus for |a 0 | > 1.84 we have 

M < 2|a 0 |{log|a 0 | + 5.93}. 

Combining this with our previous estimate we have 

Theorem 1. If F(Z ) is regular for \Z\ < 1, does not take the values 0 and 1 
and has Taylor expansion about Z = 0 


F{Z) = Oo 4" #i Z. -f“ . . . , 


then 

M < 2|a 0 |{|log|a 0 || + 5.94}. 

As Hayman has remarked in his review of (2) (Mathematical Reviews, 
16 (1955), 579) the value 5.94 cannot be replaced by 4.37. 
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THE CAUCHY PROBLEM FOR LINEAR PARTIAL 
DIFFERENTIAL EQUATIONS WITH 
RESTRICTED BOUNDARY CONDITIONS 


E. P. MILES, Jr. and- ERNEST WILLIAMS 


We shall discuss solutions of linear partial differential equations of the 
form 


( 1 ) 


$(£>, Xi, x 2 , . . . x n ) u -f- 4>( D , t) u — 0, 


where SP is an ordinary differential operator of order 5 with respect to t. Our 
first theorem gives a solution of (1) for the Cauchy data; 


(2) 


u(xi, x 2 , . . . x„, 0) = P{x 1 , x 2 , . . . X n ), 


dfu 

dt 


7 (* 1 . * 2 , . . . 0 ) = 0 , 


j 1> 2, . . . , 5 1, 


whenever the function P is annihilated by a finite iteration of the operator 
This situation occurs if P is a polynomial and $ any differential operator 
with constant coefficients and no constant term or if P is polyharmonic and <f» 
the Laplacian operator. The solution hinges upon the integration of a finite 
system of ordinary differential equations. 


Theorem 1. Suppose for some integer k we have 
(3) $*(P) ^ 0, & +1 (P) = 0; 


urther suppose that u 0 , u 2 , . . . w* are a set of solutions of the system of ordinary 
differential equations 

(4) + Uj—i = 0, j = 1, 2, . . . , k, 

*(«<,) = 0, 


with initial conditions 


( 5 ) 


tt 0 (0) = 1, Uj(0) = 0, 


cTuj 

dt m 


( 0 ) = 0 , 


j > h 

m = 1, 2, . . . , s — 1, all j\ 


then a solution of ( 1) satisfying (2) is 

k 

(6) u (*i, x 2 , . . . t) = £ & ( P ) • «;• 

J-d 
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Proof. 

k 

[$> + ¥]« = [4* + ¥] X) $\P)-Uj 

>- 0 
k 

= £[$ + *] 

J- 0 

= P-[* («„)] + t, <t.'(P)[*( M ,) + + 4 >* + '(P)-K« 

J-l 

= 0 , 

by (3), (4) and the linearity of the operators $ and 4 r . The conditions (5) on 
the u j ensure that (6) satisfies conditions (2) and the proof of the theorem is 
complete. 


As an application of this theorem we construct a solution of 
(7) 


du .du du n . n , * 

a Tx + i Jy + c Tt =f) ' “<*• y. °) - * 


We note that <£(P) = 2 axy -f- bx 2 , $ 2 (P) = 4<z&x + 2d 2 y, <f> 3 (P) = 6a 2 6 and 
$*(P) = 0. 




and 


(8) «(*, y, /) = x 2 y + (2 axy + bx 2 ) 


(¥) 


+ (4 abx + 2 a 2 y) 


G?)+ 

^(i?) 


(( cx — at) 2 (cy — bt) 
c 3 


The last form of (8) may also be obtained from the general solution of (7), 
F(cx — at, cy — bt), by requiring that it reduce to x 2 y when / = 0. 

An alternate set of Cauchy data frequently encountered for (1) when ^ is 
a second order operator is 

(9) u{X\, Xj, « • • X^, 0) = 0, Uf(Xi, X%, • • • X n , 0) Q(^li %2i • • • %n)’ 

For this case an analogous theorem holds: 

Theorem 2. Suppose (3) holds and V 0 , V\, . . . V k are the solutions of 

(10) *(Fo)=0, y(Vj) + Vj-i = 0, j — 1,2, ... k, 

with initial conditions 

(11) F i (0)=0, 

F'„(0) = 1, V'j(0) = 0, j — 1,2, ... k; 

then the Cauchy problem for (1) with boundary values (9) has a solution 


( 12 ) 


« = £ *\Q) V,. 

t- 0 
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The authors have recently obtained basic sets of homogeneous polynomial 
solutions (1) for the Laplace and wave equations in k variables. Although 
Theorems 1 and 2 were not discovered until after these basic sets were 
developed, they provide a natural way for deriving them. Let us consider the 
wave equation in three space variables and one time variable, 

(13) V 2 u-^ = 0. 

A general homogeneous polynomial W, of degree it in (x, y, z, t), has 

arbitrary coefficients. Requiring W to be a solution of (13) gives 
independent 1 conditions on these coefficients showing that 

of them are independent. We may construct our basic set of (« + l) 2 homo- 
geneous polynomial solutions of degree n for (13) by first applying Theorem 1 

monomials P(x,y,z) of type x a y b z e , a -f- b + c = n 

and then applying Theorem 2 to each of the + y monomials Q{x, y , z) 

of type x?y p 2P, a + p-\-y = n— 1. The resulting homogeneous polynomial 
solutions of (13) are 

(” 1 2 M” t ^ 


to each of the 


(”t 2 ) 



in number. The polynomials generated by Theorem 1 contain only one term 
of degree less than 2 in t, the generator term x a y b z c \ likewise those generated 
by Theorem 2 contain only one term xTyty t of lowest degree in /. Thus our 
set of solutions is independent and, since it is correctly numbered, is a basic 
set of solutions. 

The basic set constructed in the above manner may be represented as 
follows: for each set of non-negative integers a, b, c, d, a + b + c 4* d = n, 
d < 1 

U«1 i^J+d 

(14) KV »...<(*, y, 2 , t) = g V 2 VyV) • ‘ 


= £ 


a\b\c\\\D]x A y a z c f 


(v9 ! (^y 5 ) ! (v 2 ) 


‘Independence of these conditions may be readily established by a generalization of Whit- 
taker’s footnote on the corresponding harmonic polynomials (Whittaker & Watson, Modern 
Analysis, 4th ed., p. 389) or, as suggested by the referee, by using an argument like that in 
Courant-Hilbert, vol. I, English ed., at the bottom of p. 512. 
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where the final summation extends over all non-negative A, B, C and D such 
that A-\-B + C + D = n, A =a, B = b, C = c, D = d mod 2, and A < a, 
B < b, C < c. The final form of (14) corresponds, except for a constant 
factor, to the basic set for the wave equation developed in (1 ), but the inter- 
mediate form given here for the first time is much simpler to describe and 
use. 

Instead of trying to solve Cauchy’s problem for the wave equation in terms 
of the basic set of polynomials (14) when our initial conditions are suitable 
we may use: 

Theorem 3. The solution of (13) with initial conditions 

u{x, y, z, 0) = P{x, y, z ), u t (x, y, z, 0,) = Q(x, y , z), 

where P(x, y, z) and Q(x,y,z ) are polyharmonic functions of order p and q 
respectively, is given by the sum: 

P-l f ZJ ?-l ,2fc+l 

(15) U(x. y , «, 1) = g V”(P) W +Z v "(0) . 

This theorem is established by applying Theorem 1 to (13) with boundary 
conditions, u(x, y, z, 0) = P, u t {x, y,z, 0) = 0 and Theorem 2 to (13) with 
boundary conditions, u{x,y,z, 0) =0, u,(x, y, z, 0) = Q, and adding the 
resulting solutions. 

As an illustration of Theorem 3 we display a solution of the wave equation 
(13), with initial conditions 

(16) u(x, y, z, 0) = xe* cos y -z*, u t (x, y, z, 0) = 0, 

(17) u{x, y, z, t) = e? cos y[xz 4 + (z 4 + 6xz 2 ) t 2 + (2z 2 + x)t 4 + l/ 6 ]. 

In another recent paper (2) the authors gave basic sets of polynomial 
solutions for the Euler-Poisson-Darboux equation 

(18) V 2 m — [ u tt + kt~ l u t ] = 0 

and for the closely associated Beltrami equation. For the E.P.D. equation 
with k > 0 a direct application of Theorem 1 to the generator monomials 
x a y b z e , a -f b + c = n, gives a more usable form of the basic sets similar to 
that given for the wave equation in (14) with d = 0 and t xi / (2j ) ! replaced by 
l*/( 1 + £)(3 4- k) . . . (2j - 1 + k ) '2* j\. 

However, if k is negative, the system of differential equations (4) associated 
with the solution of (18) under conditions (2) has a solution, 

i £Lj~2n+l-k 

U9) u, = g 2 ‘-’U - t »)! (1 - *)(3 - k) . . . (2j - 2n + 1 - k) 

. Cl 

^ 2 ' j!(l + k)(3 + k) ... (2j — 1 + k)’ 


I 


I 
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provided k 9 * — 1, — 3, ...» — (2; — 1), which is not unique since the a n 
are arbitrary. 

Weinstein has recently shown (see for instance (3)), that for odd negative 
integral values of k, solutions of (18) satisfying certain differentiability con- 
ditions exist only if the initial value function is polyharmonic of order (1— k)/2 
and that, in this case, the addition to the solution of any function of the 
type 

t l ~ k u 2 ~ k (x i, . . . x n , t), 


(u 2 ~* denotes a solution of (18) with k replaced by 2 — k) which vanishes 
with its t derivative at t = 0, gives another solution of the problem. We may 
illustrate this result of Weinstein by applying Theorem 1 to (18) with boundary 
conditions (16). As a first solution we may take a n — 0, n = 0, 1, 2, . . . and 
obtain 

(20) u(x, y, z, t ) = e z cos y^jcz* + ( z ‘ + 6xz 2 ) j-q— j- 

+ ( ° 2 + 3v) (l -f- &)(3 k) + (l+*)(3 + *)(5 + fe) 



The solution (20) is invalid for k = — 1, —3, —5, but holds for k = — 7, 
— 9, . . . etc. Since xe? cosy- z* is polyharmonic of order ^{1 — ( — 7)} =4, 
this illustrates the result of Weinstein quoted above. As a further illustration 
of Weinstein’s result we may let a 0 9* 0 in (19) and augment the solution 
(20) by 


A-Jt 


(21) a 0 Y ' —k 6 X COS ^ xz 4 




r 


[1 + (2 - k)] 

A 


-f (6z 2 + 3x) 


+ 


t 


[1 + (2 - ^)][3 + (2 - k)] 


St 


[1 + (2 - *)][3 + (2 - k)]{5 + (2 - ^)] 


- £)il 


which is clearly an arbitrary constant multiple of by a solution of (18) 
with k replaced by (2 — k) which vanishes with its t derivative at t = 0 for 
all negative k. 

Where condition (3) does not hold, the above methods may lead us to solu- 
tions in infinite series form. As an illustration we may consider the vibrating 
string problem 


(22) u xx — a~ 2 u tt — 0, u(x, 0) = P(x), u t (x, 0) = 0, w(0, /) = u(L, t) = 0, 
where the function P(x) has the Fourier expansion 

(23) P(x) = E 4. sin ^ . 

l -L 

We then obtain 
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(24) 
and 

(25) 

Taking 

(26) 




<p- i; (-D'*,(-f) !,sin ” 
w 2 ' 


TTX 


Uj = 


( 2 j)\ ’ 


V"' J ' ni rX ( a 0 j 

*- SS (- 1 ) Ht; sm Tw 


J = 0,1,2 


we have a formal solution. If the order of summation be interchanged we 
obtain 


(27) 


, . htx nirat 

u = ) , o n sin — r— cos — 

n~l 


which is the usual form of the solution of this problem. The one dimensional 
heat flow problem may also be solved by this method but the applicability 
of the method to other problems, because of convergence questions, is a 
matter requiring further study. 
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IMPLICATIONS OF HADAMARD’S CONDITIONS 
FOR ELASTIC STABILITY WITH RESPECT TO 
UNIQUENESS THEOREMS 

J. L. ERICKSEN and R. A. TOUPIN 

Introduction. The purpose of this paper is to discuss implications of 
Hadamard’s condition for elastic stability ( 2 , §269) with respect to uniqueness 
of solutions of boundary value problems in the theory of small deformations 
superimposed on large. We show that a slightly refined form of his condition 
implies a uniqueness theorem for displacement boundary value problems. 
We construct a counter-example showing that his condition does not imply 
uniqueness of solutions for one type of stress boundary value problem. Hada- 
mard ( 2 , Ch. VI) showed that his condition implies the reality of all possible 
velocities of propagation of acceleration waves. To our knowledge, this is the 
only other known consequence of his condition. 

Truesdell (8) has focused attention on the question of what conditions 
should be imposed on the strain energy to exclude physically unacceptable 
behavior. We are indebted to him for discussing this problem with us, thereby 
stimulating our interest in the topics considered here, and for his constructive 
criticisms of our work. 

It is sufficient for our purposes to require that all vector fields considered 
be of class C 2 at all points of the undeformed body, which points constitute a 
regular region of space 9?, as defined in (3). 

1. Elasticity theory. The theory of elasticity with which we are con- 
cerned is based on the existence of a strain energy per unit of undeformed 
volume 2, which is a function of displacement gradients U a ,p. Here U a are the 
components of the displacement vector referred to a material 1 coordinate 
system and the comma denotes covariant differentiation with respect to these 
coordinates. We assume 2 is of class C 3 for all U a j, that there are no constraints 
on the deformation, and that inertial and body forces vanish. The basic 
equations may then be written 

(1) (32 /dU a , p ), 0 = 0, 

as was shown by Kirchhoff ( 4 ). 

To obtain the equations of the theory of small deformations superimposed 
on (possibly) large deformations, one writes U a = V a -f- IT", linearizes Eq. 
(1) with respect to the W a , and assumes that the displacement V a satisfies (1). 
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‘The adjective “Lagrangian” is used more frequently. For reasons pointed out in (7, §14), 
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We thus have 
( 2 ) 


d2/dU a ,(i = dZ/dV*.i,+ W\id 2 l/dV a , fi dVy,» t 

- {d2/dV*j)j, = 0, 

{Wr tt d*2/dWjdV*.i)j = 0. 


Alternative formulations are given in (1) and (6, §55). Here V a is regarded 
as known, whereas W a is to be determined by the linear equations (2) and 
appropriate boundary conditions. A displacement boundary value problem is 
set by specifying W a on the bounding surface 0 of 9?. To show that two dis- 
placement vectors satisfying these same conditions are identical, it suffices, 
because of linearity, to show that any W a which satisfies (2) and vanishes on 
0 must vanish in 9?. For equations (1), one type of stress boundary value 
problem is set by specifying N^dH,/ dU a ,p on 0, N a being a unit vector normal 
to 0. In the above linearized theory, this leads to a problem in which the 
quantities 

(3) T a = A^.,d 2 2/d V a ,ed 


are specified on 0. 

If one sets V*j = 0 in (2), W a becomes a small deformation about the 
state of zero deformation. It is customary to assume that dZ/dV" j = 0 when 
V a t fi = 0. We make no use of this assumption. If one places certain restrictions 
on d 2 2/dV a j dV y j evaluated at V a j = 0, one obtains the usual equations 
of the classical linear theory of elasticity. For example, for isotropic materials, 
one takes 

(4) 3 1 2/ar. a ar. 1 |r».,-o = MX + + G’U,) 

where A and n are the Lam6 constants and G a & is the metric tensor. The bound- 
ary data (3), which become 


(5) T a = A W'jN. + n(W a3 + Wf.a) N* 

when (4) holds, is the data ordinarily prescribed in stress boundary value 
problems in the linear theory. The Kirchhoff uniqueness proof, valid when 
(3A + 2 m) M > 0, establishes that T a = 0 on 0 implies that W a ,t + “ 0 

in 9?. In other words, the boundary data (5) determines the displacement field 
W a to within an infinitesimal rigid motion. It seems reasonable to expect 
that this uniqueness theorem will hold for the small deformation when 
V a j 0 if suitable restrictions are placed on 2 and V a . What constitutes 
a set of “suitable restrictions” on 2 is, according to Truesdell (8), the main 
open problem in the theory of finite elastic deformations. We shall show 
that the desired uniqueness does not follow from Hadamard’s stability 
condition. 


i 


l 


2. Elastic stability. Hadamard (2, §269) calls a deformation stable when- 
ever the second variation in total strain energy is non-negative for all variations 
■ in U a which vanish on 0. Formally, stability means that 

I 

I 
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(6) # - 5 2 f^V> 0 

whenever 8U a = 8 2 U a = 0 on 0, dV being the volume element. We have 
= 4>i + <£ 2 > where 

(7) «>. = j^'ir. t as/au".4v, 

*, = jj™ iU °e !,U ''‘ dl 'Z/dU’.calf.> dv - 

From (1), (6) and (7), 

4>i = j^(8 t irdX/dir J ). lt dV = jLfi’tTdS /dU*fidS fi = 0 
where dS, j is the vector element of area. Similarly, from (6) and (7), 

( 8 ) $2 = - J^8U a (8U y . s d 2 2/dU a ,( l dir ii ). li dV. 

Thus (6) can be replaced by 

$2 > 0 wherever 8U a — 0 on 0, 

$2 being given by (7) or (8). An analysis made by Kelvin (5) suggests that 
it is desirable to distinguish neutral or labile stability, for which <f> 2 = 0 
for some 8U a & 0, from ordinary stability, for which <J> 2 = 0 implies 8U a = 0, 
and we find it essential for our purposes to make this distinction. Henceforth, 
“stability” means ordinary stability, neutral stability being excluded. There 
was no reason for Hadamard to make this distinction since the results which 
he obtained are insensitive to it. 

3. Uniqueness. We begin by proving a uniqueness theorem for displace- 
ment boundary value problems. 

Theorem 1. In the theory of small deformations superimposed on large, 
if the large deformation is stable , the displacement boundary value problem for the 
small deformation has at most one solution. 

Proof. Let W a be any solution of (2) such that W a = 0 on 0. Multiplying 
the last of equations (2) by W a , summing on a, and integrating the result 
over we obtain 

V^^id^/dV^dV^t),^ = 0 . 

From this and (8), we see that $ 2 , evaluated for U a = V a and 8U a = W a , 
vanishes. If V a is stable, 4> 2 = 0 implies W a = 0. Thus, if a solution exists 
for a given displacement boundary value problem, it is unique. 
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We now proceed to determine necessary and sufficient conditions for the 
stability of the state of zero deformation of isotropic materials. In this case 
(4) holds and we obtain from (7) with U a ,p = 0, 

*2 = Jj R [x(«c/%) ! + n(&ir. s &u , .„ + sir- f su..,)w. 


Using the fact that 8U a = 0 on <0, we have 

0 = - SU e SU". t ]dS. = J] ISVlV*., - SU’tlTA^V 

« § - su’.jir.Mv. 

Also, 

8U a, ^8U a j j = 8U a t fi8U^ t a 4 " 


where 2o> a ,fi = U a ,» — Up, a . Using these relations, we obtain 
(9) *2 = (A + 2n) Jgj (tU m ..)'dV + 2fx 

a result due to Kelvin (5). Since each integral is non-negative, we have stab- 
ility, or at least neutral stability, of zero deformation, so long as X 4* 2u > 0 
and n > 0. A slightly sharper result is easily obtained. 


Lemma. For stability of the state of zero deformation of an isotropic elastic 
material , it is necessary and sufficient that X 4" 2/u > 0 and n > 0. 

Proof of sufficiency. If X 4- 2/x > 0 and n > 0, it is clear from (9) that 
$2 > 0, the equality holding if and only if 8U*, a = u a p = 0. These conditions 
imply that 8U„ = <£,«, where <f> is harmonic. Since <f>, a = 0 on @ and 0 is 
harmonic in 9?, <t>, a = 0. Hence 4> 2 > 0, unless 8U a = 0. 

Proof of necessity. To show that 3> 2 > 0 implies X 4* 2\i > 0, it suffices 
to construct functions 8U a such that 8U a = 0 on <3, o) a p s 0 in 9?, 8U a , a P* 0, 
as is clear from (9). One can take 8U a = t.a, where \J/ is any function, not a 
constant, whose gradient vanishes on ©. For example if, (£ C 9? is a sphere 
of radius r 0 > 0, we may take ^ = 0 in 5R — S, ^ = (r — r 0 ) 4 in (£, where r 
denotes the distance measured from the center of (S. Similarly, to show 
that $2 > 0 implies u > 0, one need only construct 8U a with 8U a — 0 on @, 
8U a , a = 0, 8U a & 0, and use (9). Such variations are easily constructed. 

THEOREM 2. For the stress boundary value problem (3), stability of the de- 
formation V a does not imply that the displacements W a will be determined to 
within an infinitesimal rigid motion. 

Proof. It suffices to establish that uniqueness does not follow from stability 
in the special case when V a j — 0 and (4) holds. By the lemma, we have 
stability if 3X -f 2/x = 0, u > 0. It follows, using (4), that (2) is satisfied by 
any W m such that W a j = a 8 a p, where a is an arbitrary constant. For any 
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such displacement, (5), with 3X -f- 2 m = 0, gives T a — 0 for arbitrary N*. 
If a ^ 0, the displacement considered above is not an infinitesimal rigid 
motion, whence the theorem follows. 

This theorem indicates that the definition of stability used here leads to 
results in disagreement with the intuitive notion, expounded by many writers 
in stability, that such non-uniqueness should be associated with instability. 
This might be regarded as an indication that it would be desirable to introduce 
further criteria to enable one to refine further the classification of types of 
stability used here. 

Theorem 3. Neutral stability of the deformation V a does not imply uniqueness 
of solutions to displacement boundary value problems in the theory of small 
deformations superimposed on large. 

Proof. Again it suffices to establish the theorem in the special case when 
(4) holds with X + 2 m = 0, u > 0. From (9) and the lemma, we then have 
neutral stability, but not stability. From the proof of the lemma, we can 
construct functions W a = xfr* such that W a = 0 on ©, W a & 0 in 9?. It follows 
easily, using (4), that when X -f- 2 m = 0, any such displacement satisfies (2). 
Since W a = 0 is another solution satisfying the same boundary conditions, we 
do not have uniqueness. 

Theorems 1 and 3 illustrate the importance of distinguishing between 
ordinary and neutral stability. As is pointed out by Whittaker (9, pp. 145- 
148), the case X + 2 m = 0, m > 0 is of some historical interest, having been 
considered as an aether theory. 
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A GENERALIZED AVERAGING OPERATOR 



D. B. SUMNER 

1. Introduction. The averaging operator V/(z) = §[/(z -f h) + /(z)] has 
an extensive literature, the most detailed account being that of Nbrlund 
( 4 ). In discussing solutions of the functional relation 

(1.1) V/(z) = *(b), 

he defines a “principal solution” ( 4 , p. 41) by means of a summability process, 
and later, working in terms of complex numbers, he obtains ( 4 , p. 70) a 
principal solution of (1.1) by means of a contour integral. He distinguishes his 
principal solution from other solutions, by showing that it is continuous at 
h — 0. His work includes a detailed account of the polynomial solutions 
of 

(1.2) V/(z) = z* , 

the Euler polynomials with assigned values at z = Milne-Thomson (3, 
pp. 519~521) gives an account of generalized Euler numbers arising from the 
operator V w , ( N a positive integer) and of the generalized Euler numbers. 

In this paper the ideas of Milne-Thomson are taken a step further. The 
operator V x is defined for all real X, and is shown to be applicable to a wide 
class of functions. Polynomials corresponding to the generalized Euler poly- 
nomials of Milne-Thomson and a sequence of numbers corresponding to 
Norlund’s C-numbers ( 4 , p. 27) are defined and some of their more important 
properties established. The inverse operator V~ x is defined, and is shown to 
invert the operation V x and to give a unique solution in terms of the functions 
to which V x is applicable. 

2. Generalized power of the averaging operator. The averaging (or 
mean) operator is defined for span h by 

(2.1) V/(z) = *[f(z + *) +/(z)J, 
and its positive integer powers by 

(2.2) V“m = VV'-'/W = + hp)/2". 

To define V x /(z), where X is related to the positive integer N by 

(2.3) N - 1 < X < N, 
we use the formal relation 
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and write 


V/(z) = |(1 4* exp hD).f(z), 


x = (1 + exp hD)" +1 
V 2 X (1 + exp hDf ’ 


H = N + 1 - X. 


The operation in the numerator can be expressed by means of (2.2); and to 
obtain a representation of the operation in the denominator, we use the fact 
that 

1 1_ C e+io ° exp (~tw)dw 

(1 4- exp /)“ ~ 2 71 -i Jc-iao £(<*> w) 

where t is real, a is positive, 0 < c < a and 

(2.4) E(a, w) = T(a)/T(w) T(a — w). 


Using the abbreviation 


X for X 


e+ too 
-ia o 


t 


we then have formally 


(2.5) 


?X l (N+i\ C f(z + ph - hw)dw 
V V P /Je 2ri E(n, w) 2 X ’ 


on using the shift operation exp ( kD ) ./(z) = /(z 4- k). We take (2.5) as the 
definition of V x /(z), if X satisfies (2.3), span h is positive or negative and the 
integrals exist. 

Although less restrictive assumptions as to the nature of /(z) would be 
sufficient to ensure the existence of the integrals in (2.5), we shall assume 
throughout that 


(2.6) /(z) is an entire function of exponential order k, nh < r. 


The following proposition is then an easy consequence of (2.6) and the 
fact that 

|r(e 4- iv) T(u — c — w)| ~ .4 exp( — tt|i>|) . (|t/| —*• a>): 


if 4>(z, h) is the function defined by (2.5) and f{z) satisfies (2.6), then 4>(z, h) 
is an entire function of exponential order k (in z) and 

(2.7) lim <f>(z, h) = f(z). 

ft-,0 

Thus <t>(z, h) has the property (2.7) which was noted by Norlund (4, p. 46) 
as being characteristic of his principal solution of the functional equation 
V/(z) = <f>(z). It must be observed, however, that there do exist entire functions 
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in z, for example, cos (irz/h) which satisfy neither (2.6) nor (2.7), but for which 
the operation V x is defined when X is a positive integer but not otherwise. 

In the particular case when X = N, the definition (2.5) gives for /(z) 
satisfying (2.6), 


V7(*) = 2 


-"V ( C fl* + h(P-™)]+f[z + h(P + l 
oKpJJc 2riE(\,w) 


~ «Q] 


0 \ p / ( sin mv ) 


dw 


= 2 




which is the value given in (2.2). 

We may confine ourselves to cases where h > 0 by reason of the following 
extension property: if 4>{z, h) — V x f(z), then 


(2.8) 4>(z + h\, —h)= 0(z, h ). 

For reversing the summation, and making the change of variable w — p — I-, 
we have 


<f>(z -f- h\, — h) 


r>-x n+i(n + l\ C }{z + hq — h£)dj 
o V q ) J M -e 2 TiE(n,n — £) 
9 -x y* jv+i/ iV 4* A T /(z 4- 

V \ ? /J e 2 «E(m,*) ’ 


by Cauchy’s theorem, since 0 < c < /*, 0 < — c < h, and E{n, n — £) = 

■E(m» £)• 


3. The exponential property of V\ We prove that 

(3.1) V“V*/(z) = V a+0 f(z) 


when a, /3 are positive. On account of (2.2) it is sufficient to give details for 
the cases 

(3.2) 0 < a + 0 < 1, 

(3.3) 1 < a + 0 < 2. 


For the proof in the case (3.2) write a + (3 = y. Then 


(3.4) 

and for 0 < a < 2 


£(2 

a, 0 < b < 2 - (8, 


7, w) 


(0 < c < 2 - 7); 


v*v7(*) 

(3.5) 


/ 2 V 2\ f ^5 f f[z + h{p + q — s — w)]dw 

oKpAgJ Ja 2 y 2 ri E(2 - a, s)J b 2riE{2 - 0, w) 

y * / 2\ f ds C F(s + w) dw 

~o \nj Ja 2 7 2ir i E( 2 - a, 5 ) J 6 2ri E(2 - 0, w) ’ 
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where F({j) =■ f[z + h(n — £)] + 2 f[z -f h(n 4- 1 — £)] + f[z + h(n + 2 — {)]. 
By Cauchy’s theorem we may take 0 < a < b\ then 

f 4- w) dw _ C Fix) dX 

J* 2irx E( 2 — P, w) Jo 27 rx E ( 2 — 0, £ — s) ' 


Hypothesis (2.6) guarantees the absolute convergence of the integrals in 
(3.5), so that 


V a V*/(2) 


-r(A ClWJi 

o \»/ J& 2t i 


40/. 


f r(j) r(2 - g - $ + S) r(2 - a - s) r(g 
J a 2 4 * * 7 2 t ri r (2 - a) T (2 - 0 ) 

r(t) r(4 - 7 - g) F(g) # 

2^2 7rx T(4 — 7 ) 


s) ds 


by Barnes’s Lemma (1, p. 155). Abbreviating this expression as 

(£)[/, + 2 /, + /,] 


we let the lines of integration in / 2 and 1 3 be changed to b + 1 and 6+ 2 
respectively; and since the only positive poles of the integrand are at 
{ = 4- 7,5-%... and since 4 — 7 > 3, no poles lie in the strip b < R(£) < 
b - b 2. Cauchy’s theorem may then be applied to give 

Ii 4- 2/j + I 3 — £ 

[r(£)r(4-7-f)+2r($+i)r(3-7-S)+r(£+2)r(2-7-S)]/fz+ft(ft-g)M£ 

2 ri r(4 - 7 ) 

f r(g) r (2 - 7 - £)/[« + a(» - g)] </g 
2 xx r (2 - 7 ) 

Thus we have from (3.4) 


V a V*/(*) = V“+*/(*). 

In the case (3.3) 


- ,4.00 /. /*$ 5 ss 

■40/ 


4 - q — s — w)]dw 
1 ri) E( 2 — 0, w) 


<fc 

2irx E{2 - a, s ) 

\f[z 4- h(n — s — w)] 4- f[z 4- h{n 4- 1 — s — w)] \dw 

2 7 rx E{2 — 0, w) 

and the previous argument may then be used to establish the result. 


/. 
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4. The numbers g£ and the polynomials g*(z). We digress here to 
define certain fundamental numbers and polynomials associated with V x . 
Let 

op X ,K 

(41) (l + exp/) s= (I'lOr). 

(4-2) g\{z) = £(*)**""*«• 

On writing G(t) = 2 X (1 + exp t)~ x , we obtain 


[1 +exp(-/)]G'(0 + \G(t) = 0, 

E (j)[l + exp(-0]“ , G < * +1 "“(0 + XG"(0 = 0, 

from which, on setting t = 0, and using the definition g\ — G (k) ( 0), we have 
the recurrence relations 


or 

(4.3) 


E + 2gi+. + Xgi = 0 


) P gp+l + 2g n+1 4- Xg„ = 0, 

go = 1. 


n > 0, 


It is an easy calculation to establish for the polynomials g* x (z) the generating 
relation 


2 A exp (zt) = t k g x k (z) 
(1 + exp t) x V k\ 


(4.4) 


The numbers g\ have the following explicit value in terms of the Stirling 
numbers: 

(4.5) g\ = (-)‘E y i r(x + p)/r(x) 2 ”. 


p-i 


For we have 


l-*0 




gi = lim G <*‘(0 = Iim f 

(->0 2 7Tt «/ g 


£(X, w) 


- lim (-)^E 5A X 2 + <>) f • 

i T(X) 2irt J c 2s(X + P, X — w) 


l -»0 


where 



x -*0 


AV 

/>! 


are the Stirling numbers of the second kind (2, p. 134), and use is made of 
the identity 

w* = E y lr(w + p)/r(,w). 

p — 1 
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Thus, using the notation ( X)„ = r(X + p)/T(\), 

si - i™ (-)V£ jr ^ryy+r = (-)*2 -£ 

ho p""i t ^ / p**» 1 

We prove next that 

(4.6) W^z/A) = z*. 

Writing £ = z/h, we have from (4.4), 

X t(J+n — ic) co .k 

(1 _}_ g‘) x_1 = 2 (g*(£ + n — w>) + g*+i(£ + n + 1 — «>)]. 

Multiplying throughout by 

(^)/2*.2\E(AT+ 1 - X,w), 


summing from « = 0 to N, and integrating with respect to w along the line 
R(w) = c makes the right-hand side equal to 


t vvi©. 


and the left-hand side equal to 

e i( Y' ( »* f exp (—tw)dw 

~7f =l h \n) e J e 2iri E(N + 1 — X, 


(1 + 


w) 


e ,£ (l + e‘) 


t\N 


a + «r(i + «■) 


Thus 


t k 


tsN+i-\ = e 


a 




and the result stated follows by comparing coefficients. 

We note here that the function h k g\(z/h ) has the property (2.7). 


5. The inverse operator. A definition for negative powers of V is obtained 
from the observation that formally 


V“ x <A(z) 

(5.1) 


2 X 2 X (* 
(1 + exp hDf 0(2) = 2riJ e 

2 X C <t>(z — hw)dw 

2iriJ e E(\,w) 


exp (-hDw)dw 
E(\, w) 


• </>(s) 


0 < c < X. 


We take (5.1) as the definition of V~ x <f>(z), and as before assume that <f>(z) is 
of exponential order k, kU < tt, as a sufficient condition for assuring the 
existence of the integral in (5.1). This definition is valid for any real h, but 
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we are justified in confining ourselves to the case A > 0 by the following 
extension property 

(5.2) iff(z) = V -X <£(z)» thenfiz — h\, — h ) = /(z, A). 

For on setting w = X — £, and observing that 

0 < ^(£) = X - c < X, 
we may apply Cauchy’s theorem to see that 

2 -Vd f V 

J c 2 xj£(X, w) 

0(g — /?£)(/£ 


(0 < c < X) 


-x 

-J. 


x-.2 «£(X,X-{) 
4>(s — hOdt 


' e 2ri E(\, {) 2 

The definition (5.1) is easily applied in special cases. Since 


(5.3) V“ x tf(g) = 


X 




2xf J e 2s(X, w)^To ra! 




= * w + hi X 


dw 


(~h) 


E(\,w)t!i ml 0 m r(w) 


= + 2 X E 4^ *<"’(*) E n 


rnTl W! 


— 1 


X 


r(w + />) r(x — w)<fa> 

2 ti T(p + X) 


= +w + e 4^ e ^ 


m -1 nil 

CO I m 




oo im oo r*n . (m) / \ 

= + E K *'”’(») A - E 

m— 1 "*• 


ra—0 


m! 


by (4.5) when the series converge. Thus when <£(2) = 2*, 

(5.4) V~ x 0 (*) = i + E ( k ) U'sT'gi • = E ( k ) h m i 

1 \m / 0 \m / 


.. k—m \ 
% &m 


Other simple cases would be 


(5.5) 

(5.6) 


V'V = 2V/(1 + e n ) 


,X z 


h\X 


V X sin 2 


“ sin ( z - 1)/ ( cos l) • 


That the operation (5.1) does indeed invert V x /(g) is shown in the theorem: 

Theorem. If 0(2) = 0(exp/c|2|), (JgJ — > »), K h < x, and F(z) is defined 
by (5.1), then F(z) is of exponential order k, and 


(5.7) 


V x F(g) = 0(2). 
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That F(z) = 0(exp jcjz|) may be proved in a manner similar to that by which 
(2.7) was established. To prove (5.7), letOCaCiV-l-l — X, 0<6<X, 
and consider 


^y~'(n+i\ f F(z + ph-hs)ds 

VFW " 2 h \ P /J.2«£(/+l -\,s) 




ds 


2 xiE(N + 1 - X, s) 

J* <ft[z 4- ph — fr(s + w)]dti; 


r ds r <ft(z + ph — ht)di 

TV ^ / J a 2iriE(N+ 1 - X,s) J a 


2xi E{\, w) 

<.+» 2Tt'J5(X t $ — j) 


Since a -+* b < a + X, and the poles of the inner integrand lie on the lines 
(£) — cl, a, — 1, . . . , (£) — a X, fl-j-X-J-1,.... 


Cauchy’s theorem may be applied to give 


V x F(z) 



ds 

2 t riE(N+ 1 - X,s) 



<t>(z ~b ph — h£)d% 
2x iE(\,Z-s) ' 


The exponential order of <£(z) and the order properties on vertical lines of 
the T-function (5, p. 151), are sufficient to establish the absolute convergence 
of this iterated integral, and Fubini’s theorem may be applied to give 


V x F(z) - 


y * 1 / N + l\ r <t>(z -j- ph — h£)dl- 

o \ p / J* 2 xi 

f r(s) r(X - £ + s) T(N + 1 - X - s) r(£ - s)ds 

J a 2 xi r(x) r(N + i - x) 

y' 1 (N + A f <t>(z 4~ Ph — h£)d% 

o \ p / 2 xi 

r r(s) r(X - £ + s) T(N + 1 - x - s) r(£ - s)ds 

J L . 2xi r(x) r(w + 1 - x) 


by Cauchy’s theorem, where the contour L a is obtained by deforming R(s)=a 
in such a way that the poles of T(N + 1 — X — s) T(£ — s) lie to the right 
of L a , while the poles of T(s) r(X — £ s) lie to the left. Then by Barnes’s 
Lemma (1, p. 155), 


V x F(z) 


+ A r *\z - ha - pm A 
o \ p /J 6 2wiE(N+l,i) ~ 


+ 3. 


To evaluate 

a _ v ( n + A r - p)]d * 

Y\ p / 2«E(A r + 1.*)’ 
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Cauchy’s theorem may be applied, since 0 < p < N, to give 

, f/jj+Af - ha - pm 

•VV P / 2riE(N + 1,£) 

_ rp(N +l\ C 0(z - ft£)d£ 

“TX p ) J l 2wiE(N+ l,p + £) 

f »(s - m -f ftr + A r(P + £) r(jv + 1 - p - £) , 
~ J» 2ri V\ P / r(iV+ 1) * 

f pjz - m -f (-y - 1 ),({),. . 

J, 2 «£(Ar + 1,0 V pltt-iV), { 

where 

-f (-1V - !),(£), _ s_f —AT — 1, £. .1 _ (-)" +1 (£W 
•V p! (£ - AT), '1 £ - N • *J (£ — N)„ +l 

(-JVW. r(AT+ 1 +£) T(-£) 

(£ - n) k+1 r(£) r(iv+ l - £) 

T(JV + 1 +£)(-£) 

r(£) r(Af + l - £)' 

Thus A 4- B 


r(£) r(AT+l-£) 4 »fg— A(£— iV— 1)1 - r(AT+l+£) r(-£) r(»-A£) 


X 2 ri T(N + 1) 

U r\ r(iV -f 1 + w) T(—w) <j>(z — hw)dw 
t-N-i JJ 2xir(N+l) 

; oj = 

which completes the proof. 




Res 


/ T(N + 1 + w) T(—w) <t>(z — hw) 
l T{N+1) 


6. Remarks. It is well known that the functional equation 

(6.1) V*/(s) = *00, (N = 1,2,.. .) 

has solutions other than that given by (5.1). For example, if p{z) has the 
property 

(6.2) p(z + h) + p{z) = 0, 
it is a solution of the homogeneous equation 

V7W = 0; 

and if it is added to the solution of (6.1) given by (5.1), the resulting function 
is still a solution of (6.1). It does not, however, have the property (2.7), since 
for example p(z) could be s\n{x z/h) or cos(ir z/h). Moreover it need not 
satisfy requirement (2.6), since 

cos (x z/h) = 0[exp (x\y\/h)], (|y| -+ ®), 
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and V x need not then be defined except when X = 1,2, ... . These facts 
suggest the possible existence of a set of eigenvalues X = 1,2,..., with a 
family of eigenfunctions corresponding to each eigenvalue for the operator 
V\ 
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NOTE ON A STIELTJES TYPE OF INVERSION 


PASQUALE PORCELLI 


If F(z) is an analytic function for z $ [— <», —1], g(t) of bounded variation 
and real valued for 0 < t < 1 and 

F(z) = f (1 + zt)~ l dg(t), 

Jo 


then the Stieltjes type of inversion between F(z ) and g(t) (cf. 1, p. 339, 
Theorem 7a) is 


■gvf i 'A- 7 * »> - " ,+> t ll '~ > - “ 


-) 


where 0 < v < u < 1, I m F(z) is the imaginary part of F(z) and z = — 

A second type of inversion between F(z) and g(t) was obtained by Widder 
(1, p. 340, Theorem 7b) under the additional hypothesis that g{t) is an 
absolutely continuous function. In the following theorem we shall establish 
an inversion between F(z) and the right- and left-hand derivatives of g(t) 
without the restriction that g(t) be an integral. 


Theorem. Let F{z) be analytic for z $ [— — 1], g(t) real valued and of 

bounded variation on [0, 1] and 


(1) 

then 


f{z)~ fa + *r l d g {t), 

Jo 


(/) 


for any t in (0, 1) at which the right- and left-hand derivatives g /+ (t) and g'~(t) 
exist. 


Proof. Let us suppose that g(0) = 0, 0 < t 0 < 1 and that g' + (t 0 ) and 
g'-(to) exist. If we set 

R(t) = [(t 0 - ty + (/ 0 y0 2 ] -1 

and s = toir~ l , then from (1) we have 

(2) ^ / » f V : r + »y) = ffmdg(t) + s y f 

Received December 20, 1955 and in revised form March 9, 1956. Work on this paper was 
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PASQUALE PORCELLI 


In the first integral of this expression we can replace g(t) by [g(to) + g'~(/o) 
(t — Jo) + h(t)(t — J 0 )], where h{t ) is continuous at Jo and h(t 0 ) = 0, so 
that 

j t (0 n to n to 

tR{t)dg{t) = syg'~(to) I tR{t)dt + sy I tR(t)dh(t)(t — to). 

0 «/ 0 J 0 

The first term on the right side of this equation can be integrated directly and 
we can easily verify that it approaches 2 ~ l g'~(to) as y approaches 0-K Upon 
using the integration by parts formula, the second term reduces to 

- sy °h(t)(t - t 0 )[R(t) + tR'(t)]dt. 

If J denotes the value of the last expression, then 

|/| < ^J o <0 \h(t)\R(t)dt. 

For each e > 0, there exists y > 0 such that |A(/)| < \ttl for t 0 — t < y, so 
that 

^ P | h{t)\R{t)dt < ^ f R(t)dt < e 

ir J to— r w o 

for y > 0. Since h(t) is a bounded function, there exists y' > 0 such that, 
fory < y\ 

~ (‘^{hWlRiOdt < €. 

IT Jo 

In order to treat the second integral appearing in (2) we replace g(t) by 
[g(*o) + g' + (to)(t — to) + k{t){t — Jo)] 

and proceed as above. However, in this case the integration by parts formula 
yields the additional term 

*“ l y[*(D(i-i.)/4(i+y)] 

which approaches zero with y. This completes the proof of the Theorem. 

Remark added in the revision. I am indebted to the referee for suggesting 
the revised form of the Theorem. Also, as he points out, the Theorem is valid 
if we replace the interval of integration [0, 1] by the ray [0, ®] and restrict 
z so that z $ [— 00 , 0]. 
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